This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Staff Software Engineer - Grafana Cloud Observability, Kubernetes Monitoring in United States.
This role offers a unique opportunity to shape and advance cloud observability solutions for large-scale systems, focusing on metrics, logs, and traces. You will work on developing and maintaining the backend for observability services, including Kubernetes monitoring, database observability, and cloud infrastructure metrics. The position emphasizes technical leadership, cross-team collaboration, and hands-on contribution to scalable software systems. You will also engage with open-source communities, contributing to projects that enhance observability standards globally. Ideal candidates are experienced engineers who thrive in remote, autonomous environments and are passionate about building high-quality, reliable systems that help customers monitor and optimize their infrastructure. This is a chance to influence technical strategy while mentoring team members and delivering impactful solutions.
Accountabilities:
Design, implement, and maintain scalable integrations for metrics, logs, and traces across cloud and Kubernetes environments.Build middleware, libraries, and services to simplify development and observability workflows.Lead technical direction and strategic planning for observability projects.Collaborate with product, support, and sales teams to ensure holistic, high-quality customer experiences.Contribute to open-source projects and represent the team in relevant technical forums.Mentor team members, review code, and enforce engineering best practices.Take ownership of production systems, ensuring reliability, scalability, and maintainability.Requirements:
8+ years of experience in software engineering with strong programming skills (Python, Java, Go, Rust, .NET, or similar).Hands-on experience operating and monitoring high-scale production systems on Kubernetes, including on-call responsibilities and incident management.Familiarity with observability tooling and concepts, including Grafana, Prometheus, Loki, Tempo, and OpenTelemetry.Deep understanding of distributed systems, time-series data, scalability, consistency, and high availability.Proven track record in technical leadership, guiding architectural decisions, and delivering projects end-to-end.Strong problem-solving, debugging, and mentoring skills.Excellent communication skills and ability to thrive in a fully remote, collaborative environment.Bonus: experience with Prometheus in multi-tenant environments, Kubernetes operators, CKA/CKAD certification, and open-source contributions in observability.Benefits:
Competitive US-based salary range: $174,986 - $209,983 USD, plus Restricted Stock Units (RSUs).100% remote work with a global, autonomous culture.Significant career growth opportunities within technical leadership pathways.Generous annual leave policy (30 days) including company-wide shutdown days.Access to modern AI-assisted development tools and frontier models for productivity.Contribution to open-source projects and engagement with a global engineering community.Transparent and collaborative organizational culture with approachable leadership.