Enable job alerts via email!

Staff Software Development Engineer - GPU Communication Libraries, SHMEM/MPI

Advanced Micro Devices, Inc.

California, Santa Clara (MO, CA)

Hybrid

USD 80,000 - 140,000

Full time

11 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company where your software engineering skills will help shape the future of computing. As part of a talented team, you'll design innovative solutions that enhance the performance of cutting-edge applications and benchmarks. This role offers a unique opportunity to work with the latest technology in a collaborative environment, driving improvements in software and hardware integration. If you're passionate about technology and eager to make a significant impact, this position is perfect for you. Embrace the challenge and be part of a team that values creativity and excellence.

Benefits

Health insurance
Retirement plans
Flexible working hours
Remote work options
Professional development opportunities

Qualifications

  • Strong background in software engineering with leadership skills.
  • Experience in GPU software development and communication middleware.
  • Ability to enhance maintainability and operational efficiency.

Responsibilities

  • Design software modules in C++, Python, and HIP for GPU systems.
  • Improve existing codebases for better maintainability.
  • Collaborate with architecture specialists to enhance products.

Skills

C++ programming
Python programming
HIP
CUDA
OpenCL
Agile software development
Software performance evaluations
Debugging

Education

Bachelor's degree in Computer Science
Master's degree in Computer Engineering
Bachelor's degree in Electrical Engineering

Tools

MPI
SHMEM
UCX
libfabric
RDMA APIs

Job description



WHAT YOU DO AT AMD CHANGES EVERYTHING


We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.


AMD together we advance_




THE ROLE:

AMD is looking for an influential software engineer who is passionate about improving the performance of key applications and benchmarks. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology.

THE PERSON:

The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.

KEY RESPONSIBILITIES:

  • Design software modules in C++, Python, HIP, assembly to enable collective communication software for datacenter GPU systems
  • Understand existing codebases and software designs, and make improvements to enhance maintainability and operational efficiency
  • Work with AMD's architecture specialists to improve future products and plan software support strategies
  • Aid management in planning, and delivering industry-leading software
  • Stay informed of software and hardware trends and innovations, especially pertaining to software algorithms and hardware architecture
  • Design and develop new groundbreaking AMD technologies
  • Participate in new ASIC and hardware bring ups

PREFERRED EXPERIENCE:

  • Experience with agile software development practices
  • Demonstrated capacity to technically lead developers of varying levels
  • Proficient in C/C++ & Python programming employing best software design practices
  • GPU software development involving HIP, CUDA, or OpenCL
  • Experience with at least one of the following:
    • Implementing communication middleware like MPI/SHMEM
    • Implementing lower-level communication frameworks like UCX and libfabric, or development using RDMA APIs
    • Development and optimization of communication collective algorithms (e.g. AllReduce)
  • Experience in software performance evaluations, optimizations and debugging
  • Ability to closely interact with software technical leads, program managers, and interface with hardware teams

ACADEMIC CREDENTIALS:

Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent

LOCATION: San Jose, California OR Remote

#LI-JT1

#LI-HYBRID




Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.