Enable job alerts via email!

AI Runtime Engineer

Advanced Micro Devices, Inc.

California, San Jose (MO, CA)

Hybrid

USD 90,000 - 150,000

Full time

7 days ago

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a talented developer to join their team, focusing on GPU runtime implementations and enhancing machine learning capabilities. This role offers the opportunity to tackle complex technical challenges while collaborating with industry experts. Ideal candidates will have a solid understanding of GPU architectures, programming skills in C/C++, and a passion for open-source development. Join a forward-thinking company that values creativity and teamwork, and contribute to cutting-edge projects that shape the future of computing.

Benefits

Employee Stock Purchase Plan

Annual bonus eligibility

Competitive benefits

Flexible working hours

Qualifications

Familiarity with GPU runtime APIs and architectures is essential.
Strong C/C++ skills required for efficient code development.
Experience in parallel and asynchronous programming is preferred.

Responsibilities

Design and maintain GPU runtime implementations in IREE.
Analyze model performance and propose improvements.
Develop multi-GPU communication solutions.

Skills

GPU runtime APIs

GPU drivers

GPU architectures

Parallel programming

Asynchronous programming

Resource management

Quantitative analysis

C/C++ programming

Education

BS in Computer Science

MS in Computer Engineering

Electrical Engineering

Tools

HIP

CUDA

Vulkan

DirectX

Metal

IREE

MLIR

LLVM

SPIR-V

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

The Role

We are building IREE as an open-source compiler and runtime solution to productionize ML on a variety of usage scenarios and hardware targets. Among them, having wide and performant GPU support is critical. We aim at a broad range of GPU coverage, from mobile to datacenter, via a unified software stack. It requires us to write the most efficient code to interact with the OS and device drivers with minimal dependency and small binary size. There will be no short of intriguing technical challenges to tackle, and there are abundant chances to collaborate with industry experts working at different layers of the stack. If this sounds interesting to you, please don't hesitate to reach out to us!

The Person

An ideal candidate should be familiar with GPU runtime APIs, GPU drivers, GPU architectures, OS, parallel/asynchronous programming, efficient resource management. He/she should be comfortable at performing quantitative analysis of workload and drive improvements at suitable software stack layers. Most importantly, the candidate is willing to learn and work across boundaries.

Key Responsibilities:

Design, develop, and maintain GPU related runtime implementations in IREE over HIP, CUDA, Vulkan, DirectX, Metal.
Design, develop, and maintain multi-GPU runtime and communication solutions including collectives
Manage testing and releasing of runtime components
Quantitively analyze end-to-end model performance, identify bottlenecks, propose ideas to improve, prototype and productionize solutions
Design and implement compiler passes to better schedule and utilize resources
Design and implement Python interactions with runtime components
Drive towards general solutions that benefit different all GPU targets and the overall community

Preferred Experience:

Experience with GPU APIs (HIP, CUDA, Vulkan, DirectX, Metal)
Understanding of GPU architectures
Understanding of parallel/asynchronous programming
Familiarity with operating system internals and resource management
Understanding of game engine internals
Experience with various system debugging/benchmarking/profiling tools
Strong C/C++ understanding and skills
Familiarity with IREE, MLIR, LLVM, SPIR-V or other compiler technologies
Open-source development ethos

Preferred Academic Credentials

BS/MS (Computer Science, Computer Engineering, Electrical Engineering, or related equivalent)

Location:
San Jose CA / Seattle WA / Toronto ON

#LI-G11

#LI-HYBRID

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD's Employee Stock Purchase Plan. You'll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.