
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading technology firm is seeking a developer to work on GPU support for OpenAI/Triton. You will develop and optimize features, communicate with various stakeholders, and conduct performance analysis. Candidates should have strong C/C++ skills and experience with compiler internals. This role supports growth in the field of AI and requires collaboration with the open-source community.
Project description
Working on GPU support for OpenAI/Triton — a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm.
Responsibilities
SKILLS
Must have
Nice to have
• Basic understanding of ML technologies• Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.)• Experience with PyTorch• Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation• Knowledge of ROCm infrastructure• Experience in CMake, make/ninja build system• GEMM performance fundamentals• Experience with Docker