Senior Software Infrastructure Engineer
Phizenix • Santa Clara, CA
This role is part of the Software Infrastructure team, responsible for building and scaling the core development infrastructure that supports the entire software engineering organization. You will work on designing, improving, and maintaining tools and platforms that enable development of ML accelerator systems across both hardware and software environments.
In this role, you’ll own and enhance infrastructure around GitLab, CI/CD pipelines, containerized workflows (Docker/Podman), and developer productivity tools. You’ll collaborate closely with software and hardware teams to ensure reliable, scalable, and efficient development and deployment pipelines.
The team is looking for someone with strong hands-on experience in C/C++, Python, Linux-based systems, REST APIs, containers, and modern DevOps practices, with the ability to debug and improve complex CI and build environments.
What You Bring
Required
-
Bachelor’s degree in Computer Science, Engineering, Math, Physics, or a related field
-
7+ years of industry experience in software infrastructure, DevOps, or platform engineering
-
Strong proficiency in C/C++ and Python
-
Deep experience with GitLab workflows, including merge requests, merge trains, and CI/CD pipelines
-
Hands-on experience with Docker and Podman containers
-
Experience using Kubernetes (K8s) or similar orchestration tools for deploying ML or large-scale workloads
-
Familiarity with Bazel, code coverage tooling, data source integration, and DevOps metrics
-
Proven ability to debug CI failures, perform root cause analysis, manage linting, bisect issues, handle remote code coverage, and implement security and vulnerability testing
Preferred
-
Experience working in startups or small, fast-moving teams
-
Background in AI compute, ML accelerators, or subsystem development
-
Prior experience in a DevOps-focused role
-
Exposure to MLOps, including ML model deployment pipelines and development workflows