Grasshopper

Site Reliability Engineer

Grasshopper • Singapore
Python

About Grasshopper

Grasshopper is a quantitative trading technology provider based in Singapore, and is the holding company of Grasshopper Asset Management. Our state-of-the-art technology, built from the ground up in-house, puts us at the forefront of developments in electronic trading. An unbroken record of consistency and profitability is underpinned by firm values of curiosity, empowerment and flexibility.

About the Role:

As a Site Reliability Engineer on the Infrastructure Team, you will play a key role in strengthening the reliability, scalability, and operational efficiency of our platform. You will work closely with cross-functional teams to design, build, and operate robust systems across our Google Cloud and on-premise infrastructure, with a focus on observability, automation, and production stability.

As a key member of the Infrastructure Team, you’ll:

  • Design, implement, and maintain robust observability systems, including monitoring, logging, tracing and alerting, to ensure high availability, rapid incident detection, and deep system visibility across all services.
  • Architect, develop and maintain scalable solutions on Google Cloud and on-premise infrastructure
  • Advancing and supporting our research platform capabilities
  • Investigate infrastructure/application issues on a live production system
  • Working together with developers to improve our development environment, including CI/CD, built tools, etc.
  • Help drive an SRE mindset within the organisation

We’d love for you to have:

  • 3–5 years of hands-on experience in Platform, SRE, or Infrastructure Engineering.
  • Experience working in a trading, research, or compute-intensive environment (e.g., research platforms, backtesting systems, large-scale batch processing, or AI/ML workloads) is preferred.
  • Solid engineering fundamentals in Linux, systems, networking, debugging, and distributed systems.
  • Strong problem-solving and analytical skills, with a structured approach to troubleshooting and root cause analysis.
  • Practical experience operating and supporting Kubernetes-based systems in production.
  • Familiarity with GitOps workflows, using tools such as Argo CD and CI/CD platforms (e.g., GitLab CI).
  • Good understanding of cloud infrastructure (GCP or AWS), including deployment, scaling, and basic networking concepts.
  • Programming experience in Python or Go, with a focus on automation and reliability tooling.
  • Strong collaboration and communication skills, with attention to detail.
  • Self-motivated, adaptable, and comfortable working in a fast-moving technical environment.
  • Curiosity and willingness to learn new systems, tools, and technologies.

It will be highly useful if you have prior experience in the following technologies:

  • Previous exposure to kubernetes operators.
  • Experience with clean and maintainable Terraform for declarative infrastructure management.
  • Prior knowledge and experience in on-premises bare metal environments.
  • Familiarity with configuration management tools such as Puppet, Chef, or Ansible.
  • Experience with Argo-CD and Argo Workflows for workflow automation.
  • Working knowledge of monitoring and observability tools such as Prometheus and or OpenTelemetry (OTel) ecosystem.
  • Familiarity with RedHat and CentOS-based Linux distributions.
  • Prior contributions to open-source projects..
  • Experience working with large-scale workflow, batch, or HPC-style compute workloads, including scheduling and execution reliability. 

What we offer:

  • 21 days annual leave
  • An opportunity to learn from experienced professionals, fostering mentorship opportunities and personal growth
  • Comprehensive Insurance Package with extended coverage for dependents
  • Well stocked pantry
  • Annual Dental & Wellness budget
  • Gym membership
  • Competitive compensation

What you can expect working at Grasshopper:

At Grasshopper, you will be working in a diverse and dynamic environment with a flat hierarchy. With over 100 employees and 15 nationalities working in an open office, communication is essential to performance. To keep our edge as the “small giant” of trading technology, we give employees a high level of autonomy and encourage them to get creative, take risks, make mistakes and learn from them. The sprint is on!

Grasshopper is an equal opportunity employer.