Aktiviere Job-Benachrichtigungen per E-Mail!
A leading technology consultancy in Switzerland is seeking a developer to work on GPU support for OpenAI/Triton. You will be responsible for developing, supporting, and optimizing the project for GPUs, and communicating with various stakeholders. The ideal candidate has strong C/C++ skills and experience with compiler internals. Join us in enhancing deep learning performance in the open-source community.
Project description
Working on GPU support for OpenAI/Triton — a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm.
Responsibilities
SKILLS
Must have
Nice to have
• Basic understanding of ML technologies• Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.)• Experience with PyTorch• Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation• Knowledge of ROCm infrastructure• Experience in CMake, make/ninja build system• GEMM performance fundamentals• Experience with Docker