Enable job alerts via email!

Platform Engineer I - Machine Learning Infrastructure

Spotify

Toronto

Hybrid

CAD 80,000 - 110,000

Full time

23 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a Machine Learning Infrastructure Engineer to develop and maintain a robust ML platform. The role involves managing Kubernetes clusters, collaborating with teams, and implementing scalable solutions. Flexibility for remote work is offered, with some in-person meetings required.

Qualifications

  • 1+ years of hands-on experience implementing production ML infrastructure.
  • Knowledge of deep learning fundamentals and algorithms.

Responsibilities

  • Manage and maintain large scale production Kubernetes clusters for ML workloads.
  • Collaborate with Machine Learning Engineers and product teams for scalable solutions.

Skills

Python
Go
Agile Software Processes

Tools

Huggingface
Ray
PyTorch
TensorFlow

Job description

The Hendrix ML Platform team is dedicated to developing a robust, Spotify-wide platform for training and serving machine learning models. This platform streamlines the productionization of AI and ML models by mitigating the incidental complexities involved in creating backend services for serving predictions and training models.


What You'll Do
  • Manage and maintain large scale production Kubernetes clusters for ML workloads, including ML platform infrastructure and necessary dev ops.
  • Contribute to Spotify ML Platform SDK and build tools for various ML operations.
  • Collaborate with Machine Learning Engineers (MLE), researchers, and various product teams to deliver scalable ML platform tooling solutions that meet the timelines and specifications of given requirements.
  • Work independently and collaboratively on squad projects that often requires learning and applying new technologies that may go beyond existing skillsets.
  • Designs, documents and implements reliable, testable and maintainable solutions ML infrastructure capabilities.
Who You Are
  • You have 1+ years of hands-on experience implementing production ML infrastructure at scale in Python, Go or similar languages
  • Knowledge of deep learning fundamentals, algorithms, and open-source tools such as Huggingface, Ray, PyTorch or TensorFlow
  • Contributed to a production ML Model or ML infrastructure
  • You have a general understanding of data processing for ML
  • You have experience with agile software processes and modular code design following industry standards
Where You'll Be
  • This role is based in Toronto.
  • We offer you the flexibility to work where you work best! There will be some in person meetings, but still allows for flexibility to work from home.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Engineer I

TD

Toronto

On-site

CAD 76,000 - 116,000

10 days ago

Machine Learning Engineer I

Affirm

London

Remote

CAD 102,000 - 142,000

22 days ago

Engineer I

TD

Toronto

On-site

CAD 76,000 - 116,000

22 days ago

Engineer I (Computer Application Forensic )

TD

Old Toronto

Hybrid

CAD 76,000 - 116,000

30+ days ago

Software Engineer I, Entry Level (Fall 2024-Spring 2025) - Toronto

DoorDash

Toronto

On-site

CAD 60,000 - 100,000

30+ days ago

Senior Security Engineer I

Braze Inc.

Ontario

On-site

CAD 80,000 - 120,000

30+ days ago

New Grad Civil Engineer I - Summer 2025

HNTB

Ontario

On-site

USD 71,000 - 112,000

30+ days ago

New Grad Civil Engineer I: Drainage - Summer 2025

HNTB

Ontario

On-site

USD 71,000 - 112,000

30+ days ago

New Grad Civil Engineer I: Structures - Summer 2025

HNTB

Ontario

On-site

USD 74,000 - 112,000

30+ days ago