This team owns the data and data platform to support all Perception model training and validation, as well as system validation beyond Perception. The platform supports sample curation, data mining, data labeling, and ML dataset management. High-quality data and efficient data flywheel are central to ML model development, enabling teams to efficiently iterate and improve model performance at scale while Zoox expands its business domain.
In this role, you will improve and scale the platform, while also building new features and expanding cross-pillar adoption through Python/C++ tooling, APIs, and data engineering. Your work spans backend API development, data pipelines, and integration with model training and testing workflows, ultimately providing the infrastructure that powers data flywheel at scale.
In this role, you will:
Work on the Data Flywheel that supports multiple Perception ML teams with data mining, labeling, and curation, to help ML teams improve model performance via better data.Quickly ramp up on our team’s current stack and systems, deliver solutions, and improve solutions based on other teams’ feedbackInnovate solutions that leverage AI to drive process automation and efficiencyHelp coach more junior engineers by reviewing their design and implementation
Qualifications
10 YOE+Expert-level proficiency in Python; experienced with or feeling comfortable ramping up C++Strong background in high-performance computing, data processing, and software architecture design and implementationExperience with data query and analysis on a large data warehouse, such as DatabricksExperience in working with AND supporting multiple teams (supporting ML teams is preferred), leading other team members
Bonus Qualifications
Experience in ML and data-driven mode developmentKnowledge of frontend stack (JavaScript, Vue, or React.js)Experience in tooling/infra that supports multiple teams