Enable job alerts via email!

Senior GPU Kernel Developer

Luxoft

Fatih

On-site

TRY 150,000 - 300,000

Full time

17 days ago

Job summary

A leading technology firm in Turkey is seeking an experienced developer with GPU compute and performance profiling expertise. You will lead efforts to optimize HIP kernels on AMD GPUs and collaborate with development teams to enhance GPU-accelerated applications. Ideal candidates should have a strong background in C++, CUDA/HIP programming, and a solid understanding of GPU architectures. Opportunities for growth and innovation await in a dynamic team environment.

Qualifications

  • Proficiency with C++ and low-level programming (at least C++ 17).
  • Proficiency in CUDA or HIP/ROCm programming.
  • Solid understanding of GPU architectures and optimization techniques.
  • Strong problem-solving skills in a collaborative environment.
  • Experience in AI/ML/DL/NN/NLP/Computer Vision.

Responsibilities

  • Optimize HIP kernels for specific AMD hardware.
  • Collaborate with teams to enhance GPU-accelerated applications.
  • Debug, profile, and fine-tune code for performance improvements.
  • Stay updated on GPU architectures and programming models.

Skills

C++
CUDA/HIP programming
GPU architectures
Parallel programming
AI/ML/DL experience
Python

Tools

Linux
Profiling tools
gdb/LLDB
Job description
Overview

Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team. We are seeking an experienced individual proficient in GPGPU applications to join our team. The primary responsibility of this role will be to lead the effort in optimizing HIP kernels on AMD GPUs. The candidate should possess a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Additionally, familiarity with optimization techniques is highly desirable.

Responsibilities
  • The main task will be to help optimize HIP kernels for specific AMD hardware.
  • Collaborate with development teams to optimize and enhance GPU-accelerated applications.
  • Debug, profile, and fine-tune code for performance improvements.
  • Stay updated with the latest advancements in GPU architectures and programming models.
Skills

Must have

  • Proficiency with C++ and low-level programming (at least C++ 17).
  • Proficiency in CUDA or HIP / ROCm programming.
  • Solid understanding of GPU architectures, parallel programming models, and optimization techniques.
  • Strong problem-solving skills and the ability to work in a collaborative environment.
  • One of AI/ML/DL/NN/NLP/Computer Vision experience.
  • Python.

Nice to have

  • Linux
  • CPU Intrinsics (AVX/SSE)
  • GPU Assembler
  • Profiling
  • gdb/LLDB
  • Jinja2 or similar templating engines
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.