Enable job alerts via email!

ML Compiler Engineer I, Annapurna Labs

Amazon

Cupertino (CA)

On-site

USD 90,000 - 150,000

Full time

24 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company as a Machine Learning Compiler Engineer, where you'll be at the forefront of developing cutting-edge compiler technologies for machine learning workloads. This role offers the chance to work on innovative software designs that solve operational challenges and enhance developer efficiency. You'll collaborate with a talented team, contribute to impactful projects, and engage in research that pushes the boundaries of AI capabilities. If you're passionate about compiler development and eager to make a significant impact in the world of machine learning, this is the opportunity for you.

Qualifications

Strong proficiency in C++ and Python for compiler projects.
Experience with compiler optimizations and ML frameworks.

Responsibilities

Innovate and deliver creative software designs for new services.
Write design documents and integration test plans.

Skills

C++

Python

Compiler Development

Machine Learning

AI Accelerators

Education

Bachelor's Degree

Master's Degree

Tools

LLVM

OpenXLA/XLA

MLIR

TVM

CUDA

Annapurna Labs builds custom Machine Learning accelerators that are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Neuron Compiler team is searching for compiler-skilled engineering talent to support the development and scaling of a compiler to enable the world's largest ML workloads to run performantly on these custom Annapurna systems.

The Product: The AWS Machine Learning accelerators represent a pinnacle of AWS technologies, specifically designed for advancing AI capabilities. The Inferentia/Trainium chips specifically offer unparalleled ML inference and training performances. They are enabled through state-of-the-art software stack - the AWS Neuron Software Development Kit (SDK). This SDK comprises an ML compiler, runtime, and application framework, which seamlessly integrate into popular ML frameworks like PyTorch. AWS Neuron, running on Inferentia and Trainium, is trusted and used by leading customers such as Snap, Autodesk, and Amazon Alexa.

The Team: Annapurna Labs was a startup company acquired by AWS in 2015, and is now fully integrated. If AWS is an infrastructure company, then think Annapurna Labs as the infrastructure provider of AWS. Our org covers multiple disciplines including silicon engineering, hardware design and verification, software, and operations. AWS Nitro, ENA, EFA, Graviton and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe, are some of the products we have delivered over the last few years.

Within this ecosystem, the Neuron Compiler team is developing a deep learning compiler stack that takes state of the art LLM and Vision models created in frameworks such as TensorFlow, PyTorch, and JAX, and makes them run performantly on our accelerators. The team is comprised of some of the brightest minds in the engineering, research, and product communities, focused on the ambitious goal of creating a toolchain that will provide a quantum leap in performance.

You: As a Machine Learning Compiler Engineer I on the AWS Neuron Compiler team, you will be supporting the ground-up development and scaling of a compiler to handle the world's largest ML workloads. Architecting and implementing business-critical features, publish cutting-edge research, and contributing to a brilliant team of experienced engineers excites and challenges you. You will leverage your technical communications skill as a hands-on partner to AWS ML services teams and you will be involved in pre-silicon design, bringing new products/features to market, and many other exciting projects.

A background in compiler development is strongly preferred. A background in Machine Learning and AI accelerators is preferred, but not required.

In order to be considered for this role, candidates must be currently located or willing to relocate to Cupertino (preferred), Seattle, or Toronto.

Key job responsibilities

Innovating and delivering creative SW Designs to develop new services, solve operational problems, drive improvements in developer velocity, or positively impact operational safety.
Writing requirements capturing documents, design documents, integration test plans, and deployment plans.
Communicating status and progress of deliverables to schedule, and sharing learnings/ innovations with your team and stakeholders.

BASIC QUALIFICATIONS

To qualify, applicants should have earned (or will earn) a Bachelors or Masters degree between December 2022 and September 2025.
Proficiency in C++ and Python programming, applied to compiler projects.
Experience developing compiler optimizations or ML framework internals.

PREFERRED QUALIFICATIONS

Knowledge of code generation, compute graph optimization, resource scheduling.
Experience optimizing Tensorflow, PyTorch or JAX deep learning models.
Experience with multiple toolchains like LLVM, OpenXLA/XLA, MLIR, TVM.
Familiarity with CUDA programming for GPU acceleration.

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.