Enable job alerts via email!

Sr. Machine Learning Infrastructure Engineer, Optimus

Tesla, Inc.

Palo Alto (CA)

On-site

USD 116,000 - 360,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company as a Senior Machine Learning Infrastructure Engineer, where you'll develop tools and infrastructure to enhance neural networks for humanoid robots. This exciting role involves building Python training systems, managing datasets, and collaborating with hardware teams to ensure high availability for machine learning tasks. Your contributions will directly impact the deployment of cutting-edge technology in real-world applications, making a significant difference in the evolving landscape of robotics. If you're passionate about machine learning and want to see your work in action, this is the opportunity for you!

Benefits

Aetna PPO and HSA plans
Dental and vision plans
401(k) with employer match
Company paid life insurance
Employee Assistance Program
Sick and vacation time
Back-up childcare resources
Employee discounts
Voluntary benefits
Weight loss programs

Qualifications

  • Practical experience in Python and/or C++ programming.
  • Understanding of modern machine learning and deep learning concepts.

Responsibilities

  • Build and improve Python training infrastructure for faster training.
  • Manage, analyze, and visualize training and test datasets.

Skills

Python Programming
C++ Programming
Machine Learning Concepts
Deep Learning
Data Visualization
GPU Resource Management

Tools

PyTorch
HSA Plans

Job description

Sr. Machine Learning Infrastructure Engineer, Optimus

As a Software Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture, visualize data, assist with exporting and deploying neural networks to the bot, and evaluate experimental results. You will help us automate the entire workflows of training, validation, and production of the Optimus. Most importantly, you will see your work repeatedly shipped to and utilized by thousands of Humanoid Robots in real world applications.

What You’ll Do

  • Build and improve our Python training infrastructure for stable and faster training
  • Build the tooling and infrastructure for reporting and visualizing model metrics and performance
  • Build the pipelines to run and validate our PyTorch models
  • Manage, analyze, and visualize our training and test datasets
  • Coordinate with the team managing the hardware cluster to maintain high availability / jobs throughput for Machine Learning
  • Build and improve tooling to deploy trained neural nets to Tesla hardware

What You’ll Bring

  • Practical experience programming in Python and/or C++
  • Proficient in system-level software, particularly hardware-software interactions and resource utilization
  • Understanding of modern machine learning concepts and state of the art deep learning
  • Experience working with training frameworks, ideally PyTorch
  • Demonstrated experience scaling neural network training jobs across clusters of GPU’s
  • Optional: Previous experience in deep learning deployment
  • Optional: Profiling and optimizing CPU-GPU interactions (pipelining compute/transfers, etc)

Compensation and Benefits

Along with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire:

  • Aetna PPO and HSA plans > 2 medical plan options with $0 payroll deduction
  • Family-building, fertility, adoption and surrogacy benefits
  • Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution
  • Company Paid (Health Savings Account) HSA Contribution when enrolled in the High Deductible Aetna medical plan with HSA
  • Healthcare and Dependent Care Flexible Spending Accounts (FSA)
  • 401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
  • Company paid Basic Life, AD&D, short-term and long-term disability insurance
  • Employee Assistance Program
  • Sick and Vacation time (Flex time for salary positions), and Paid Holidays
  • Back-up childcare and parenting support resources
  • Voluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
  • Weight Loss and Tobacco Cessation Programs
  • Tesla Babies program
  • Commuter benefits
  • Employee discounts and perks program

Expected Compensation

$116,000 - $360,000/annual salary + cash and stock awards + benefits

Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.

Tesla is an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to any factor, including veteran status and disability status, protected by applicable federal, state or local laws.

Tesla is also committed to working with and providing reasonable accommodations to individuals with disabilities. Please let your recruiter know if you need an accommodation at any point during the interview process.

Privacy is a top priority for Tesla. We build it into our products and view it as an essential part of our business. To understand more about the data we collect and process as part of your application, please view our Tesla Talent Privacy Notice.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

HPC Engineer, AI Infrastructure

Tesla, Inc.

Palo Alto

On-site

USD 133,000 - 356,000

30+ days ago