At WHOOP, we're on a mission to unlock human performance and healthspan. WHOOP empowers members to perform at a higher level and live longer through a deeper understanding of their bodies and daily lives. Protecting our members’ data and ensuring our systems scale securely and reliably is core to this mission.
The Application Infrastructure team enables iOS, Android, Backend, and Web engineers to reliably and quickly deliver new features to members. We own and operate critical shared infrastructure, including Kubernetes and Kafka, and build the tooling and guardrails that allow teams across WHOOP to ship safely at high velocity.
As a Senior DevOps Engineer on the AppInfra team, you will play a key role in driving large-scale architecture projects, collaborating with cross-functional teams to design and implement infrastructure that is highly scalable, resilient, and secure. You will build systems that help increase the safety of deployments to Kubernetes while also increasing the velocity of releasing new features. You will be responsible for ensuring optimal performance and stability for the entire software delivery pipeline. You will help set the direction of the AppInfra team to impact engineers across all WHOOP technology stacks.
RESPONSIBILITIES
Design, develop, and operate WHOOP’s Kubernetes clusters running on AWS infrastructureDrive architectural decisions to improve scalability, resiliency, performance, and security across the build and deployment platformBuild systems and tooling that increase deployment safety and accelerate release velocity to KubernetesAdvance CI/CD capabilities to support frequent, reliable production deploymentsLead developer productivity improvements through tooling, automation, and platform integrationsPartner with application, security, and data teams to embed secure-by-default infrastructure practicesParticipate in incident response, root cause analysis, and postmortems to continuously improve platform reliabilityMentor and provide technical leadership to engineers on the Application Infrastructure teamHelp define and execute the long-term roadmap for infrastructure and Kubernetes management at WHOOP
QUALIFICATIONS:
5+ years of experience in DevOps, Platform, Site Reliability, CloudEngineering, or Backend Software Engineering rolesDeep understanding of Kubernetes architecture and core componentsStrong knowledge of container networking concepts, including overlay networking, service meshes, and network policiesExperience with multi-cluster Kubernetes environments and inter-cluster communication patternsHands-on experience operating cloud infrastructure, preferably in AWS (e.g., IAM, VPC, EC2, S3, RDS, CloudTrail, Organizations)Hands-on experience with Infrastructure as Code tools (e.g. Terraform)Experience developing backend or infrastructure-adjacent services using Java, C#, or PythonProven ability to evaluate system performance, identify bottlenecks, and use data to drive improvementsExperience collaborating with multiple stakeholders and prioritizing work for maximum business impact
BONUS QUALIFICATIONS:
Experience operating Kafka or other large-scale distributed systemsExperience with Kubernetes security best practices, including RBAC, secrets management, and pod security standardsExposure to service reliability practices such as SLOs, SLIs, and error budgetsPrior experience supporting compliance or security-focused infrastructure initiatives
ABOUT YOU:
You bring a security-first mindset to everything you build and operateYou enjoy working on infrastructure that enables hundreds of engineers to move fasterYou’re comfortable operating in complex, high-scale production environmentsYou enjoy teaching, mentoring, and raising the technical bar for those around youYou’re curious, adaptable, and excited to learn across a wide range of systems and technologies