Almedia

Staff Site Reliability Engineer / DevOps

Almedia • Berlin, Berlin, Germany
Remote

This isn’t your regular job. Almedia is a place where those who want to push harder can accelerate their careers faster than anywhere else. We’re aiming to become Germany’s second bootstrapped unicorn. Almedia is already Europe’s #3 fastest-growing company in 2025 (FT1000).

We are building the future of marketing by rewarding our community of over 50 million users for engaging with our advertisers’ products. We are offering a new way to acquire users for the biggest companies in the world.

At Almedia, you’ll:

  • Own way more, way earlier — you’ll be trusted with responsibility fast.

  • Push harder, get further — this isn’t a 9–5. We highly reward intensity.

  • Join a rare environment — you will work with ambitious high-speed, high-ownership people.

  • Fully present — we’re 5 days a week in the office to build the energising momentum we need.

Staff Site Reliability Engineer / DevOps

📍 Berlin (preferred) or Remote

About you

  • An SRE or DevOps engineer with hands-on experience in high-traffic production systems

  • Strong in Linux, databases (MySQL, Postgres, MongoDB, Redis), and networking fundamentals

  • Comfortable with Kubernetes, CI/CD pipelines, and observability tools like Datadog

  • A self-starter who thrives in scaling environments and can work independently without PMs

  • Pragmatic, able to balance prevention, maintenance, and firefighting when needed

Your mission is to

  • Take ownership of uptime and reliability for a platform serving 50M+ users

  • Build robust monitoring, alerting, and incident response practices

  • Improve CI/CD pipelines and enable safe deployments (blue-green, canary)

  • Partner with engineers across teams to fix pain points in infra, tooling, and reliability

  • Bring initiatives that make the platform automatically reliable, cost-efficient, and scalable

Your impact

  • Collaborate with engineering teams to improve operational workflows and resilience

  • Design smart alerts, improve observability, and drive better performance monitoring

  • Lead incident response, including on-call, and drive improvement with blameless postmortems

  • Build safer delivery methods and improve deployments with Kubernetes and GitLab pipelines

  • Report directly to the CTO and act as the primary reliability leader in the company

Your toolkit

  • Linux, networking (TCP/IP), and distributed systems troubleshooting

  • Databases: MySQL, Postgres, MongoDB, Redis

  • Kubernetes, GitLab pipelines, CI/CD best practices

  • Observability tools like Datadog, OpenTelemetry, or ELK stack

  • Nice-to-haves: RabbitMQ, Kafka, Terraform, Ansible, GCP, Datadog

What makes this role exciting

  • Be the first senior SRE hire with ownership of reliability across the entire platform

  • Shape infrastructure and processes for a scale-up growing beyond 100 FTE

  • Work on a product serving millions of users worldwide with real engineering challenges

  • Gain autonomy while collaborating with strong product and engineering teams

  • Join a culture that values pragmatism, initiative, and continuous improvement

Why Almedia?

  • Own Our Growth: We offer all Berlin-based employees equity in Almedia to truly be a part of our success.

  • Scale With Almedia: Grow alongside a startup that has been profitable from day one.

  • Central Berlin Office: Work from a fully-stocked modern office built for collaboration, accessible from all around Berlin.

  • Other Benefits: Transport subsidy, breakfasts and lunches, language learning, Urban Sports Club, and more.

  • We Listen: We regularly add to our benefits through rigorous employee feedback.

We believe in fostering talent, evaluating all skill levels during the hiring process, and providing a clear path for growth. Almedia is an equal opportunity employer. We embrace and celebrate diversity, and encourage individuals from all backgrounds to apply.