Job Search and Career Advice Platform

Attiva gli avvisi di lavoro via e-mail!

Gpu Compiler Software Development Engineer (Gpu, Cuda - Must)

Luxoft Italy

Milano

In loco

EUR 40.000 - 60.000

Tempo pieno

Oggi
Candidati tra i primi

Genera un CV personalizzato in pochi minuti

Ottieni un colloquio e una retribuzione più elevata. Scopri di più

Descrizione del lavoro

A leading technology solutions provider in Italy is seeking a developer to enhance GPU support for OpenAI / Triton. The ideal candidate has strong programming skills in C/C++, experience with compiler internals, and knowledge of performance analysis. You will be involved in developing new features, optimizing existing code, and documenting your work in a friendly and dynamic environment. This position offers flexible working hours and access to various training programs.

Servizi

Flexible working hours
Access to training center
Continuous learning programs
Employee discounts
Dynamic projects
Knowledge-sharing communities
Team-building events

Competenze

  • Strong programming skills in C/C++.
  • Experience with compiler tools and frameworks like LLVM or GCC.
  • Familiarity with performance analysis techniques.

Mansioni

  • Develop and optimize features for OpenAI/Triton on GPUs.
  • Communicate effectively with team members and stakeholders.
  • Implement systematic tests and maintain project documentation.

Conoscenze

Strong C / C++ programming skills
Experience with compiler internals (llvm, gcc)
Basic Python programming skills
Experience in performance analysis
Basic understanding of ML technologies
Experience with GPGPU computing (HIP, CUDA, OpenCL)
Experience with PyTorch
Experience with LLVM and MLIR compiler infrastructure
Knowledge of ROCm infrastructure
Experience in CMake, make / ninja build system
GEMM performance fundamentals
Experience with Docker
Descrizione del lavoro
Project description

Working on GPU support for OpenAI / Triton — a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm.

Responsibilities

New features development, support and optimization of OpenAI / Triton project for GPUs. Communication with other developers, customers and project managers. Test implementation, project documentation and verification of system with unit / component / functional tests.

Skills
Must have
  • Strong C / C++ programming skills
  • Experience with compiler internals (llvm, gcc or any other)
  • Basic Python programming skills
  • Experience in performance analysis
Nice to have
  • Basic understanding of ML technologies
  • Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.)
  • Experience with PyTorch
  • Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation
  • Knowledge of ROCm infrastructure
  • Experience in CMake, make / ninja build system
  • GEMM performance fundamentals
  • Experience with Docker
Languages

English : B2 Upper Intermediate

Benefits
  • Flexible working hours
  • Access to Luxoft Training Center (technical & leadership courses)
  • Continuous learning & development programs
  • Employee discounts and perks
  • Dynamic and fast-paced projects
  • Knowledge-sharing communities
  • Brainstorming sessions and idea-sharing meetings
  • Friendly company culture
  • Team-building events & celebrations
Ottieni la revisione del curriculum gratis e riservata.
oppure trascina qui un file PDF, DOC, DOCX, ODT o PAGES di non oltre 5 MB.