E-Space

Site Reliability Engineer (SRE) / DevOps Engineer

E-Space • US
Python
Ready to make connectivity from space universally accessible, secure, and actionable? Then you’ve come to the right place!

At E-Space, we’re focused on bridging Earth and space with the world’s most sustainable low Earth orbit (LEO) satellite network. We’re a team of bold thinkers, ambitious leaders and dynamic doers—and we’re disrupting NewSpace by fundamentally changing the design of legacy LEO space systems to deliver entirely new satellite capabilities at a fraction of the cost.

We’re intentional, we’re unapologetically curious and we’re 100% committed—to saving space, to protecting our planet and to turning connectivity into actionable intelligence.

What you will be doing:

  • Design, deploy, and maintain highly-scalable, highly-available software systems in AWS
  • Architect and manage containerized applications on Amazon EKS with focus on reliability and performance
  • Build and maintain Infrastructure as Code using Terraform for AWS cloud resources
  • Develop and optimize CI/CD pipelines for automated testing, deployment, and rollback capabilities
  • Implement comprehensive monitoring, alerting, and observability solutions using CloudWatch, Prometheus, and Grafana
  • Ensure system reliability through SLI/SLO definition, error budgets, and incident response procedures
  • Collaborate directly with engineering teams to optimize application deployment and operations
  • Manage deployments and scaling strategies to support mission-critical operations
  • Automate and enforce cloud security, governance, and compliance controls
  • Participate in on-call rotation and lead incident response for production level systems
  • What you bring to this role:

  • 5+ years of experience in SRE, DevOps, or Platform Engineering roles
  • Proven experience designing and operating mission-critical, highly-available systems within AWS
  • Advanced proficiency in Infrastructure as Code using Terraform (OpenTofu)
  • Deep experience with Kubernetes, EKS, Helm, and container orchestration
  • Strong CI/CD pipeline development and management experience (Bitbucket preferred)
  • Proficiency in Python and Bash scripting for automation
  • Experience with monitoring and observability tools (Prometheus, Grafana, ELK Stack)
  • Knowledge of capacity planning and performance optimization
  • Experience with database operations and scaling (RDS, Aurora, or similar)
  • Extra bonus points for the following:

  • AWS Solutions Architect Professional, Certified Kubernetes Administrator (CKA), or equivalent expertise
  • Experience with incident management and post-mortem processes
  • Experience with GitOps workflows and tools (ArgoCD, Flux)
  • Knowledge of service mesh technologies (Istio, Linkerd)
  • Experience with chaos engineering and disaster recovery planning
  • Experience with Zero Trust Networking (ZTNA) or VPN solutions
  • Background in aerospace, defense, or other mission-critical industries
  • Strong intellectual curiosity and commitment to continuous learning
  • Exceptional attention to detail and an ownership mentality