Data Engineer, Data Infrastructure

Spotify • CA

Python Remote

The Data Infrastructure product area enables Spotify to solve complex and critical data engineering problems by providing the infrastructure and tools for engineers to build and manage planet-scale data pipelines. The teams that operate in this space are a group of versatile engineers that build the foundational data processing and management elements of that infrastructure. The products that we own are used by data practitioners across the company to create some of our most beloved consumer products such as Discover Weekly and Wrapped.

We are looking for an experienced software engineer that shares our common interest in building, maintaining and expanding our data processing technology offering while ensuring their scalability meets Spotify’s ever growing data needs. You’ll help build the tools that empower teams across the company to process data at scale that solves their critical business needs, shaping the developer experience for data engineers and anyone working with data at the company. Due to the swift progress of AI and agents, this emerges as an incredibly captivating field.

What You'll Do

Build large-scale batch and real-time data processing tools on Google Cloud Platform, aiming to improve developer experience and standardize workflows with the goal to eventually move to a monorepo for data.

Collaborate with product owners, engineers and other squads to build features and drive improvements in the data processing ecosystem.

Expand the data processing product and technology offering for Spotify to meet the dynamic and ever changing needs of our customers.

Enable the data practitioner community by improving and evolving our data engineering ecosystem at Spotify through support, best-practices and standards.

Who You Are

You have 2+ years of professional data engineering experience.

You have strong coding skills in a modern programming language and a solid understanding of systems design, data structures and algorithms.

You are passionate about developer experience and building products that help users efficiently solve their use cases

You know how to work with high-volume, heterogeneous data, preferably with distributed systems in the cloud.

You have experience with Python at least one JVM-based data processing framework such as Spark, Flink, Dataflow, Storm, etc.

You employ sound engineering practices such as continuous delivery, defensive programming and automated testing and care about shipping high-quality code.

You are comfortable with change and love working in an environment where you experiment, iterate quickly and take ownership of projects from ideation to production.

Bachelor’s degree or higher in Computer Science or related fields is a plus.

Where You'll Be

This role is based in Toronto.

We offer you the flexibility to work where you work best! There will be some in person meetings, but still allows for flexibility to work from home.

Apply Now