Job Search and Career Advice Platform

Enable job alerts via email!

Remote Senior ML Systems Engineer — Scalable GPU Infra

Pathway (pathway.com)

Remote

PLN 180,000 - 240,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading AI startup is seeking a Senior ML Systems/ML DevOps Engineer to manage the infrastructure for machine learning training and inference workloads. This role involves operating GPU-heavy clusters, designing and automating the ML platform, and solving complex problems across various cloud providers. Candidates should have extensive experience in DevOps or Platform roles and a strong background in Linux and cloud infrastructure. The position offers remote work options and is available to candidates in the EU and North America.

Benefits

Intellectually stimulating work environment
Inclusive workplace culture
Exciting career prospects

Qualifications

  • 5+ years of experience in DevOps/SRE/Platform/Infrastructure roles running production systems.
  • Deep familiarity with Linux as a daily driver.
  • Strong experience with workload management.

Responsibilities

  • Operate and scale GPU-heavy clusters for training and inference.
  • Design, build, and automate the ML platform.
  • Work across multiple cloud providers solving networking and cost optimization problems.

Skills

Linux administration
Container orchestration
Cloud infrastructure management
Workload management
Shell scripting
Monitoring and logging
Python programming

Tools

Docker
Kubernetes
AWS
GCP
Azure
Terraform
Job description
A leading AI startup is seeking a Senior ML Systems/ML DevOps Engineer to manage the infrastructure for machine learning training and inference workloads. This role involves operating GPU-heavy clusters, designing and automating the ML platform, and solving complex problems across various cloud providers. Candidates should have extensive experience in DevOps or Platform roles and a strong background in Linux and cloud infrastructure. The position offers remote work options and is available to candidates in the EU and North America.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.