Staff Site Reliability Engineer / DevOps
Almedia • Berlin, Berlin, GermanyThis isn’t your regular job. Almedia is a place where those who want to push harder can accelerate their careers faster than anywhere else. We’re aiming to become Germany’s second bootstrapped unicorn. Almedia is already Europe’s #3 fastest-growing company in 2025 (FT1000).
We are building the future of marketing by rewarding our community of over 50 million users for engaging with our advertisers’ products. We are offering a new way to acquire users for the biggest companies in the world.
At Almedia, you’ll:
Own way more, way earlier — you’ll be trusted with responsibility fast.
Push harder, get further — this isn’t a 9–5. We highly reward intensity.
Join a rare environment — you will work with ambitious high-speed, high-ownership people.
Fully present — we’re 5 days a week in the office to build the energising momentum we need.
Staff Site Reliability Engineer / DevOps
📍 Berlin (preferred) or Remote
About you
An SRE or DevOps engineer with hands-on experience in high-traffic production systems
Strong in Linux, databases (MySQL, Postgres, MongoDB, Redis), and networking fundamentals
Comfortable with Kubernetes, CI/CD pipelines, and observability tools like Datadog
A self-starter who thrives in scaling environments and can work independently without PMs
Pragmatic, able to balance prevention, maintenance, and firefighting when needed
Your mission is to
Take ownership of uptime and reliability for a platform serving 50M+ users
Build robust monitoring, alerting, and incident response practices
Improve CI/CD pipelines and enable safe deployments (blue-green, canary)
Partner with engineers across teams to fix pain points in infra, tooling, and reliability
Bring initiatives that make the platform automatically reliable, cost-efficient, and scalable
Your impact
Collaborate with engineering teams to improve operational workflows and resilience
Design smart alerts, improve observability, and drive better performance monitoring
Lead incident response, including on-call, and drive improvement with blameless postmortems
Build safer delivery methods and improve deployments with Kubernetes and GitLab pipelines
Report directly to the CTO and act as the primary reliability leader in the company
Your toolkit
Linux, networking (TCP/IP), and distributed systems troubleshooting
Databases: MySQL, Postgres, MongoDB, Redis
Kubernetes, GitLab pipelines, CI/CD best practices
Observability tools like Datadog, OpenTelemetry, or ELK stack
Nice-to-haves: RabbitMQ, Kafka, Terraform, Ansible, GCP, Datadog
What makes this role exciting
Be the first senior SRE hire with ownership of reliability across the entire platform
Shape infrastructure and processes for a scale-up growing beyond 100 FTE
Work on a product serving millions of users worldwide with real engineering challenges
Gain autonomy while collaborating with strong product and engineering teams
Join a culture that values pragmatism, initiative, and continuous improvement
Why Almedia?
Own Our Growth: We offer all Berlin-based employees equity in Almedia to truly be a part of our success.
Scale With Almedia: Grow alongside a startup that has been profitable from day one.
Central Berlin Office: Work from a fully-stocked modern office built for collaboration, accessible from all around Berlin.
Other Benefits: Transport subsidy, breakfasts and lunches, language learning, Urban Sports Club, and more.
We Listen: We regularly add to our benefits through rigorous employee feedback.
We believe in fostering talent, evaluating all skill levels during the hiring process, and providing a clear path for growth. Almedia is an equal opportunity employer. We embrace and celebrate diversity, and encourage individuals from all backgrounds to apply.