Ativa os alertas de emprego por e-mail!
Melhora as tuas possibilidades de ir a entrevistas
Cria um currículo adaptado à oferta de emprego para teres uma taxa de sucesso superior.
A leading company seeks a GPU Communications Developer to provide technical leadership in developing communication software solutions for AMD GPUs. The role involves engaging with clients and stakeholders, mentoring engineers, and ensuring satisfaction through tailored solutions. Candidates should have deep expertise in RDMA and distributed programming models, with a proven track record in communication software.
1 week ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
Direct message the job poster from Luxoft
Project Description:
The ROCm Communication Collectives Library (RCCL) is a stand-alone library that provides multi-GPU and multi-node collective communication primitives optimized for AMD GPUs. It uses PCIe and xGMI high-speed interconnects.
Responsibilities:
• Provide deep technical leadership and guidance for GPU communication technologies, define the technical vision and direction for the GPU communication software stack.
• Engage with executives and key stakeholders to provide insight into industry trends and recommend strategic initiatives. Influence the future direction of the company's technical portfolio.
• Represent AMD in leadership positions at industry organizations and standards bodies.
• Engage with clients and industry partners to deeply understand technical needs, ensuring their satisfaction with tailored solutions that leverage your experience in strategic customer engagements and architectural wins.
• Collaborate with hardware and software architects, system engineers and business teams in identifying requirements and building roadmaps for future products.
• Mentor engineers and technical leaders, fostering a culture of innovation and excellence. Help develop the next generation of leaders through coaching, training, and feedback.
Mandatory Skills Description:
• Experience architecting and developing communication software solutions for accelerators using RDMA and accelerator-to-accelerator fabrics (eg. Infinity Fabric, UALink), from low-level device drivers and OS internals up through applications and AI/ML frameworks
• Deep expertise with distributed programming models (MPI, SHMEM), and the implementation and optimization of collective communication algorithms
• Deep expertise with RoCE, RDMA, and network topologies
• Experience with system software development in C/C++, and GPU software development and parallel programing
• Analytical and performance analysis skills
• Effective communication and problem-solving skills
• Proven history of communication software thought leadership, backed with patents, publications, and participation in industry standards bodies
Nice-to-Have Skills Description:
Advanced degrees, such as Master's or Ph. D. are preferred
Languages:
Referrals increase your chances of interviewing at Luxoft by 2x
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.