Site Reliability Engineer (SRE) / DevOps Engineer

E-Space · US · 271d ago

Python

About the role

Ready to make connectivity from space universally accessible, secure, and actionable? Then you’ve come to the right place!

At E-Space, we’re focused on bridging Earth and space with the world’s most sustainable low Earth orbit (LEO) satellite network. We’re a team of bold thinkers, ambitious leaders and dynamic doers—and we’re disrupting NewSpace by fundamentally changing the design of legacy LEO space systems to deliver entirely new satellite capabilities at a fraction of the cost.

We’re intentional, we’re unapologetically curious and we’re 100% committed—to saving space, to protecting our planet and to turning connectivity into actionable intelligence.

What you will be doing:

Design, deploy, and maintain highly-scalable, highly-available software systems in AWS

Architect and manage containerized applications on Amazon EKS with focus on reliability and performance

Build and maintain Infrastructure as Code using Terraform for AWS cloud resources

Develop and optimize CI/CD pipelines for automated testing, deployment, and rollback capabilities

Implement comprehensive monitoring, alerting, and observability solutions using CloudWatch, Prometheus, and Grafana

Ensure system reliability through SLI/SLO definition, error budgets, and incident response procedures

Collaborate directly with engineering teams to optimize application deployment and operations

Manage deployments and scaling strategies to support mission-critical operations

Automate and enforce cloud security, governance, and compliance controls

Participate in on-call rotation and lead incident response for production level systems

What you bring to this role:

5+ years of experience in SRE, DevOps, or Platform Engineering roles

Proven experience designing and operating mission-critical, highly-available systems within AWS

Advanced proficiency in Infrastructure as Code using Terraform (OpenTofu)

Deep experience with Kubernetes, EKS, Helm, and container orchestration

Strong CI/CD pipeline development and management experience (Bitbucket preferred)

Proficiency in Python and Bash scripting for automation

Experience with monitoring and observability tools (Prometheus, Grafana, ELK Stack)

Knowledge of capacity planning and performance optimization

Experience with database operations and scaling (RDS, Aurora, or similar)

Extra bonus points for the following:

AWS Solutions Architect Professional, Certified Kubernetes Administrator (CKA), or equivalent expertise

Experience with incident management and post-mortem processes

Experience with GitOps workflows and tools (ArgoCD, Flux)

Knowledge of service mesh technologies (Istio, Linkerd)

Experience with chaos engineering and disaster recovery planning

Experience with Zero Trust Networking (ZTNA) or VPN solutions

Background in aerospace, defense, or other mission-critical industries

Strong intellectual curiosity and commitment to continuous learning

Exceptional attention to detail and an ownership mentality

Tech stack

Python

Location US

Posted 271d ago

findatechjob

Tech jobs straight from company career pages. No recruiters, no middlemen, no spam.

Countries

United States United Kingdom Germany Canada

Languages

Python TypeScript Go Rust

Company

About