Staff Site Reliability Engineer
**About Us**:
SentinelOne is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real-time. Singularity XDR ingests data and leverages our patented AI models to deliver autonomous protection. With SentinelOne, organizations gain full transparency into everything happening across the network at machine speed - to defeat every attack, at every stage of the threat lifecycle.
We are a values-driven team where names are known, results are rewarded, and friendships are formed. Trust, accountability, relentlessness, ingenuity, and OneSentinel define the pillars of our collaborative and unified global culture. Were looking for people that will drive team success and collaboration across SentinelOne. If youre enthusiastic about innovative approaches to problem-solving, we would love to speak with you about joining our team!
**What are we looking for?**:
We are looking for Staff SRE with prior extensive operations experience for a SaaS product, who can drive deployment re-architecture with focus on self-service and automation. Someone who has delivered SaaS products on multi-cloud, on-prem and air gapped environments, driven continuous delivery of software, has run incident post-mortems, has provided feedback to engineering architecture decisions and has automated repetitive operational tasks.
You will join a like minded team of SREs who help run our operations smoothly at scale by building a platform on which S1s services can run. If the thought of running a large scale cybersecurity platform on various cloud providers, on-prem and air gapped environments excite you, youve found the right place!
As a team we value good written communication skills, data driven decisions and a keen eye for continuous improvements. Youll help simplify, have a passion for new ideas and know how to execute iteratively towards the final goal. We value candor and collaboration.
**What will you do?**:
SRE organizations mission at SentinelOne (S1) is to keep our uptime promise to our customers by ensuring we meet our SLOs/SLAs, help our engineering teams ship software to our customers fast and with quality and ensure our customers are successful.
- In this job as Staff SRE / Tech Lead, you will join the Core SRE team at S1 and have an amazing opportunity to drive outcomes that improve reliability, stability and cost efficiency of S1s Singularity Platform - our largest customer facing service, which has over 11,000 B2B/B2G customers deployed across over 5 regions and 2 cloud service providers.
- **Big projects** that are upcoming that you could work on include e.g.: Monitoring and Observability Uplift, Logging Pipeline modernization and more!
**Your tools**: Git, ArgoCD, Jenkins, Ansible, Kubernetes, Docker, Kafka, AWS, GCP, Terraform
**What experience & skills should you bring?**:
- Multiple years of experience in running site reliability for SaaS products, running operations at a large scale and extensive experience in leading design and architecture of infrastructure (cloud and on-prem combined)
- Multi-cloud experience, deep expertise with at least one of AWS/GCP/Azure platforms
- Extensive production experience with orchestration systems like Kubernetes, Nomad or Mesos (We are a Kubernetes shop),
- Any experience with Rancher, Platform9 or other managed k8s providers is desired
- Familiarity with on-prem and air gapped deployments on top of k8s
- Demonstrated experience with Kafka and Redis
- Familiar with IaaC and tools (Terraform or Pulumi)
- Familiarity with CI and practical delivery using any of the major tools, familiarity with deployment strategies like blue green, rolling deploys, canary deploys and best practices around deployment automation (with tools like shipit or spinnaker) is desired
- Demonstrated Proficiency in at least 1 mainstream language (Python/GoLang/Ruby/etc)
- Familiarity with SecOps & Compliance processes and their touch points with SRE is desired
- Polyglot experience with other SRE tools - we integrate with more tools every day
- Keeping a pulse on latest SRE trends and Open Source
- Prior product building experience
**Apart from the above technical skills, following soft skills are required**:
- Curiosity, fast-learning, pursuit to improvements, great communication
- Ability to work in a diverse and distributed team
- A self-starter that is passionate and motivated by new technologies and has empathy for legacy systems
- A quick learner that can navigate through unfamiliar programming languages, systems and processes
**Why us?**:
You will work on real-world problems and make an impact by protecting our customers from cyber threats. You will join a cutting-edge project and will be able to influence the architecture, design and structure of our core platform. You will tackle extraordinary challenges and work with the very BEST in the industry.
- Flexible working hours, In Prague & nearby were working in a hybrid m
💡 Doporučuji: Vytvořte si svůj profesionální životopis (zdarma a snadno), se kterým zvýšíte šanci na získání lepší práce.
💡 Podívejte se na video 6 tipů pro životopis, díky kterým získáte pozvánku na pohovor, které Vám pomůže s přípravou životopisu a motivačního dopisu pro zvýšení šancí na pozvání na pohovor.
Zajímavé nabídky práce v okolí:
Práce Staff Site Reliability Engineer: Často kladené otázky
👉 V jakém městě se nabízí nabídka práce Staff Site Reliability Engineer?
Práce je nabízena v lokalitě Praha.
👉 Jaká firma nabírá na tuto pozici?
Tato nabídka práce je do firmy SentinelOne.