Enable job alerts via email!

AIML - Senior Machine Learning Infrastructure Engineer -ML Compute, ML Platform & Technology

Apple

Santa Clara (CA)

On-site

USD 120,000 - 180,000

Full time

7 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a Senior Machine Learning Infrastructure Engineer to enhance ML workloads and contribute to groundbreaking products. In this role, you will design and deliver solutions for ML compute infrastructure, collaborate with various teams, and influence key features impacting millions. Your expertise in Python and Go, along with knowledge of distributed systems and cloud platforms, will be crucial. Join a dynamic team where your contributions will drive technological advancements and shape the future of machine learning.

Qualifications

  • 4+ years experience in scalable backend systems for ML models.
  • Proficiency in Python or Go with strong knowledge of distributed systems.

Responsibilities

  • Design and deliver innovative solutions for ML compute infrastructure.
  • Collaborate on ML workloads including training, inference, and fine-tuning.

Skills

Python
Go
Distributed Systems
Cloud Platforms
Communication Skills

Education

Bachelor's degree in Computer Science
Advanced degrees in relevant fields

Tools

Kubernetes
Ray
Beam
Flink

Job description

AIML - Senior Machine Learning Infrastructure Engineer - ML Compute, ML Platform & Technology

Location: Santa Clara, California, United States

Department: Machine Learning and AI

Summary

Posted: Feb 14, 2025

Role Number: 200590937

Apple fosters innovation through diversity and shared ideas. Join us to contribute to groundbreaking products and services that impact millions worldwide.

Role Overview

As a Senior Engineer on the ML Compute Team, you will collaborate with various teams to enhance ML workloads such as training, inference, and fine-tuning. Your work will influence key Apple features and reach a broad user base.

Responsibilities
  • Design and deliver innovative solutions for ML compute infrastructure
  • Communicate complex systems effectively
  • Collaborate on ML workloads including training, inference, and fine-tuning
  • Stay informed on industry and company-wide technological trends
  • Write code in Go and Python
  • Conduct code reviews and mentor new team members
Minimum Qualifications
  • Bachelor's degree in Computer Science, Engineering, or related field
  • 4+ years experience in scalable backend systems for ML models
  • Proficiency in Python or Go
  • Strong knowledge of distributed systems, containerization, and cloud platforms
  • Experience with Kubernetes, Ray, Beam, Flink
  • Excellent communication skills
Preferred Qualifications
  • Advanced degrees in relevant fields
  • Experience with accelerators like GPU, TPU, AWS Trainium
  • Familiarity with ML frameworks such as JAX, TensorFlow, PyTorch, TensorRT, vLLM
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.