Aktiviere Job-Benachrichtigungen per E-Mail!

Senior GPU Compiler Software Development Engineer

Luxoft

Schweiz

Remote

CHF 100’000 - 130’000

Vollzeit

Vor 30+ Tagen

Zusammenfassung

A leading technology consultancy in Switzerland is seeking a developer to work on GPU support for OpenAI/Triton. You will be responsible for developing, supporting, and optimizing the project for GPUs, and communicating with various stakeholders. The ideal candidate has strong C/C++ skills and experience with compiler internals. Join us in enhancing deep learning performance in the open-source community.

Qualifikationen

  • Strong C/C++ programming skills required.
  • Experience with compiler internals such as llvm or gcc is essential.
  • Basic Python programming knowledge is necessary.
  • Experience in performance analysis is crucial.

Aufgaben

  • Develop and optimize new features for OpenAI/Triton project focused on GPUs.
  • Communicate with developers, customers, and project managers.
  • Implement tests, document the project, and verify with unit tests.

Kenntnisse

Strong C/C++ programming skills
Experience with compiler internals (llvm, gcc)
Basic Python programming skills
Experience in performance analysis

Tools

Docker
CMake
Jobbeschreibung

Project description

Working on GPU support for OpenAI/Triton — a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm.

Responsibilities

  • New features development, support and optimization of OpenAI/Triton project for GPUs. Communication with other developers, customers and project managers. Test implementation, project documentation and verification of system with unit/component/functional tests.

SKILLS

Must have

  • Strong C/C++ programming skills
  • Experience with compiler internals (llvm, gcc or any other)
  • Basic Python programming skills
  • Experience in performance analysis

Nice to have

• Basic understanding of ML technologies• Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.)• Experience with PyTorch• Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation• Knowledge of ROCm infrastructure• Experience in CMake, make/ninja build system• GEMM performance fundamentals• Experience with Docker

Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.
eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.