Attiva gli avvisi di lavoro via e-mail!

Senior GPU Kernel Developer

Experteer Italy

Italia

Remoto

EUR 50.000 - 90.000

Tempo pieno

3 giorni fa
Candidati tra i primi

Aumenta le tue possibilità di ottenere un colloquio

Crea un curriculum personalizzato per un lavoro specifico per avere più probabilità di riuscita.

Descrizione del lavoro

Join a forward-thinking company as a Senior Developer specializing in GPU computing and performance optimization. This role focuses on leading efforts to optimize HIP kernels on AMD GPUs, requiring expertise in CUDA or HIP frameworks. Work collaboratively with development teams to enhance GPU-accelerated applications and stay updated with the latest advancements in GPU architectures. This is an exciting opportunity to contribute to cutting-edge projects in the automotive industry while working remotely in Italy.

Competenze

  • Proficient in GPGPU applications with a strong background in GPU computing.
  • Experience in optimizing HIP kernels on AMD GPUs.

Mansioni

  • Help optimize HIP kernels for specific AMD hardware.
  • Collaborate with teams to enhance GPU-accelerated applications.

Conoscenze

C++ (at least C++ 17)
CUDA or HIP / ROCm programming
GPU architectures understanding
Parallel programming models
Optimization techniques
Problem-solving skills
AI/ML/DL/NN/NLP/Computer Vision
Python

Strumenti

Linux
Profiling tools
gdb/LLDB
Jinja2

Descrizione del lavoro

Project description

Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team.

We are seeking an experienced individual proficient in GPGPU applications to join our team. The primary responsibility of this role will be to lead the effort in optimizing HIP kernels on AMD GPUs. The candidate should possess a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Additionally, familiarity with optimization techniques is highly desirable.

Responsibilities
  1. The main task will be to help optimize HIP kernels for specific AMD hardware.
  2. Collaborate with development teams to optimize and enhance GPU-accelerated applications.
  3. Debug, profile, and fine-tune code for performance improvements.
  4. Stay updated with the latest advancements in GPU architectures and programming models.
Skills

Must have

  • Proficiency with C++ and low-level programming (at least C++ 17).
  • Proficiency in CUDA or HIP / ROCm programming.
  • Solid understanding of GPU architectures, parallel programming models, and optimization techniques.
  • Strong problem-solving skills and the ability to work in a collaborative environment.
  • Experience with AI/ML/DL/NN/NLP/Computer Vision.
  • Python.

Nice to have

  • Linux.
  • CPU Intrinsics (AVX/SSE).
  • GPU Assembler.
  • Profiling tools.
  • gdb/LLDB.
  • Jinja2 or similar templating engines.
Other
  • Languages: English - B2 Upper Intermediate.
  • Seniority: Senior.

Location: Remote Italy, Italy

Req. VR-110143

Technologies: C/C++, Automotive Industry

Application deadline: 30/04/2025

Ottieni la revisione del curriculum gratis e riservata.
oppure trascina qui un file PDF, DOC, DOCX, ODT o PAGES di non oltre 5 MB.