Enable job alerts via email!

Staff AI/ML Compiler Development Engineer

Advanced Micro Devices, Inc.

California, San Jose (MO, CA)

Hybrid

USD 90,000 - 150,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company at the forefront of AI technology! As a key member of the AI team, you will architect and define innovative models for Neural Processing Units, tackling complex challenges in AI model development. This role offers the chance to work collaboratively with cross-functional teams, optimize AI software components, and push the boundaries of what's possible in AI. If you're passionate about shaping the future of AI and have a strong background in deep learning, this opportunity is perfect for you. Embrace the challenge and make a significant impact in a dynamic environment!

Qualifications

  • Strong engineering skills for AI model development.
  • Experience in optimizing CNN and Generative AI models.

Responsibilities

  • Research and implement efficient CNN and Generative AI models.
  • Develop optimization techniques including quantization and sparsity.

Skills

AI model development
CNN optimization
Generative AI models
ML compilers
Cross-team collaboration
Quantization techniques
C/C++ programming
Python programming

Education

BSc in Computer Science
MSc in Computer Science

Tools

PyTorch
TensorFlow
ONNX

Job description

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences—building blocks for data centers, AI, PCs, gaming, and embedded systems. Our culture pushes the limits of innovation to solve important global challenges. We value execution excellence and foster a culture of being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance

THE ROLE:

We seek a dynamic, energetic individual to join our growing AI team. This role involves architecting and defining AI workload models, data flow, and performance metrics for Neural Processing Units (NPUs), including network performance modeling and bottleneck analysis on pre/post silicon platforms. As part of our team, you will have the opportunity to shape the future of AI model development.

THE PERSON:
  • Strong engineering skills to address complex AI model development challenges.
  • Experience in optimizing and accelerating CNN and Generative AI models, with excellent cross-team collaboration skills.
  • Proficiency in developing ML compilers for efficient network mapping on NPU.
  • Ability to work with cross-functional teams to optimize AI software stack components such as compilers, frameworks, device drivers, and firmware.
  • Experience with emerging ML models based on CNN and transformers, including performance characterization.
  • Knowledge of quantization, sparsity, architecture search methods to optimize Generative AI models.
  • Ability to collaborate with software engineers, data scientists, and researchers to integrate AI models into applications.
KEY RESPONSIBILITIES:
  • Research, design, and implement methods for efficient CNN and Generative AI models.
  • Develop model optimization techniques including quantization, sparsity, and NAS.
  • Collaborate with team members and other teams, including the compiler team, to develop optimization strategies.
PREFERRED EXPERIENCE:
  • Experience with deep learning frameworks such as PyTorch, ONNX, or TensorFlow.
  • Experience in model compression, quantization, and inference optimization.
  • Strong coding skills in C/C++ and Python.
  • Experience with LLMs, stable diffusion, NeRF, or text-to-video generation is a plus.
  • Solid understanding of AI/ML concepts and practical experience applying them.
  • Knowledge of AI acceleration hardware/software performance implications.
  • Experience developing and optimizing code for VLIW processors, analyzing high-performance operators, and understanding AI frameworks like ONNX.
ACADEMIC CREDENTIALS:
  • BSc or MSc with relevant industry experience.
LOCATION:

San Jose, CA

#LI-JT1 #LI-HYBRID

At AMD, your base pay is part of your total rewards. Compensation depends on skills, experience, and location. You may be eligible for incentives such as bonuses or stock options, including participation in AMD's Employee Stock Purchase Plan. We offer competitive benefits, detailed here.

AMD is an equal opportunity employer. We consider all applicants without regard to legally protected characteristics and are committed to accommodating applicants' needs throughout the recruitment process.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Software Development Engineer, Adobe FireFly

Adobe Inc.

California

Hybrid

USD 113,000 - 207,000

5 days ago
Be an early applicant

R&D Engineer II - Deep Learning Physics

ANSYS, Inc.

Canonsburg

On-site

USD 80,000 - 140,000

30+ days ago