Job Search and Career Advice Platform

Enable job alerts via email!

Senior GPU Compiler Software Development Engineer

Luxoft

Fatih

On-site

TRY 150,000 - 300,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading technology firm is seeking a developer to work on GPU support for OpenAI/Triton. You will develop and optimize features, communicate with various stakeholders, and conduct performance analysis. Candidates should have strong C/C++ skills and experience with compiler internals. This role supports growth in the field of AI and requires collaboration with the open-source community.

Qualifications

  • Strong C/C++ programming skills.
  • Experience with compiler internals (llvm, gcc or any other).
  • Basic Python programming skills.
  • Experience in performance analysis.

Responsibilities

  • Develop and optimize new features for OpenAI/Triton project for GPUs.
  • Communicate with developers, customers, and project managers.
  • Implement tests and verify the system.

Skills

Strong C/C++ programming skills
Experience with compiler internals (llvm, gcc or any other)
Basic Python programming skills
Experience in performance analysis

Tools

Docker
CMake
HIP
CUDA
OpenCL
Job description

Project description

Working on GPU support for OpenAI/Triton — a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm.

Responsibilities

  • New features development, support and optimization of OpenAI/Triton project for GPUs. Communication with other developers, customers and project managers. Test implementation, project documentation and verification of system with unit/component/functional tests.

SKILLS

Must have

  • Strong C/C++ programming skills
  • Experience with compiler internals (llvm, gcc or any other)
  • Basic Python programming skills
  • Experience in performance analysis

Nice to have

• Basic understanding of ML technologies• Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.)• Experience with PyTorch• Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation• Knowledge of ROCm infrastructure• Experience in CMake, make/ninja build system• GEMM performance fundamentals• Experience with Docker

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.