Jobgether

Senior Site Reliability Engineer (DevTools)

Jobgether • IT
GoJavaPythonKotlin Remote

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior Site Reliability Engineer (DevTools) based in Italy.

This is an exciting opportunity to join a highly technical engineering environment focused on building and operating large-scale developer infrastructure. In this role, you will help maintain, optimize, and evolve critical development platforms that support thousands of daily builds, large-scale source code repositories, and extensive artifact storage systems. Working at the intersection of software engineering and site reliability, you will contribute to resilient, self-healing architectures while improving developer productivity and user experience. You will collaborate with talented engineers, solve complex infrastructure challenges, and leverage modern technologies, including AI-powered tools and automation. The position offers significant ownership, technical depth, and the chance to influence the future of engineering productivity at scale.

Accountabilities

  • Design, operate, and continuously improve large-scale developer infrastructure and internal tooling platforms.
  • Build and maintain reliable, fault-tolerant, and self-healing systems that ensure high availability and performance.
  • Analyze user feedback, identify pain points, and implement solutions that enhance developer experience and productivity.
  • Optimize system performance, reduce operational friction, and improve the efficiency of development workflows.
  • Develop, customize, and extend both open-source and commercial tools to better meet organizational needs.
  • Contribute to software development initiatives across multiple programming languages and technology stacks.
  • Monitor platform health, troubleshoot incidents, and implement preventive measures to improve reliability.
  • Collaborate with engineering teams to define meaningful operational metrics and validate improvements through measurable outcomes.
  • Support users by resolving technical issues, providing guidance, and ensuring platform stability.
  • Explore and integrate emerging technologies, including AI-assisted workflows and developer productivity solutions.
  • Requirements

    • Proven experience combining Site Reliability Engineering and Software Engineering responsibilities in production environments.
    • Strong programming skills and hands-on development experience with languages such as Java, Kotlin, Go, Python, Ruby, or similar.
    • Solid understanding of Unix/Linux operating systems, system internals, and infrastructure troubleshooting.
    • Strong knowledge of JVM-based applications, performance optimization, and operational best practices.
    • Experience designing, operating, and improving highly available and scalable systems.
    • Passion for enhancing user experience through engineering excellence and continuous improvement.
    • Ability to adapt quickly, solve complex technical problems, and perform effectively in fast-changing environments.
    • Strong analytical thinking, troubleshooting capabilities, and attention to detail.
    • Excellent communication and collaboration skills within cross-functional engineering teams.
    • Experience in Platform Engineering, developer platforms, or internal tooling environments is highly valued.
    • Familiarity with version control systems, CI/CD platforms, and build infrastructure such as GitLab, TeamCity, or equivalent solutions is advantageous.
    • Experience with Spring Framework, Java-based monolithic applications, or large-scale enterprise systems is considered a plus.
    • Comfortable participating in technical assessments and coding interviews as part of the hiring process.
    • Benefits

      • Competitive compensation package.
      • Career development, continuous learning, and professional growth opportunities.
      • Flexible working arrangements that support work-life balance.
      • Opportunity to work on innovative and impactful AI-driven technologies and infrastructure.
      • Collaborative, inclusive, and engineering-focused culture.
      • Exposure to complex technical challenges at significant scale.
      • International work environment with highly skilled and diverse teams.
      • High levels of ownership, autonomy, and influence over technical decisions.
      • Opportunity to contribute to the future of developer platforms and cloud technologies.
      • Dynamic environment that encourages innovation, bold thinking, and continuous improvement.
      • Equal opportunity workplace committed to diversity, inclusion, and fair employment practices.