SRE Engineering
Pnlfinancials LLC • NLWe are looking for a Senior/Lead SRE Engineer to drive the design, implementation, and evolution of a Kubernetes-based platform in a multi-cloud environment (GCP/AWS).
What You Will Be Doing
This role requires strong ownership of reliability, scalability, and platform architecture for high-load, mission-critical systems operating 24/7.
Lead design and operation of Kubernetes platform (GKE, multi-cluster)
Own and evolve CI/CD and GitOps practices (GitLab CI, ArgoCD)
Define and implement observability strategy (metrics, logs, tracing)
Design scalable, fault-tolerant, and highly available systems
Drive infrastructure automation (Terraform)
Partner with engineering teams to improve system reliability and performance
Establish SRE best practices (SLOs, SLAs, error budgets)
Tech stack: GCP, AWS, Kubernetes, GitLab CI, ArgoCD, Terraform, Prometheus/Grafana/VictoriaMetrics, Cloud Logging, Kafka, Vault, PostgreSQL,Redis,RabbitMQ,OpenTelemetry
Who You Are
Strong hands-on experience with Kubernetes in production (including multi-cluster setups)
Proven experience with high-load systems and large-scale infrastructure
Experience designing and operating highly available 24/7 systems
Strong experience in system scaling, performance optimization, and fault tolerance
Deep knowledge of observability (Prometheus, logging, tracing)
Solid experience with CI/CD and GitOps approaches
Strong experience with cloud platforms (GCP preferred, AWS as a plus)
Proven experience with Infrastructure as Code (Terraform)
Experience leading initiatives or mentoring engineers
Strong collaboration skills with engineering teams
English level B2+
Nice to have
Understanding of standards (ISO 27001, PCI DSS, GDPR)