Job Search and Career Advice Platform

Ativa os alertas de emprego por e-mail!

Senior GPU Kernel Developer

Luxoft

São Paulo

Presencial

BRL 120.000 - 160.000

Tempo integral

Hoje
Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A leading technology firm in São Paulo is looking for experienced developers proficient in GPU computing and performance profiling. The role focuses on optimizing HIP kernels on AMD GPUs, requiring strong skills in C++, CUDA or HIP programming. Candidates should be problem solvers, work collaboratively, and have an understanding of AI/ML techniques. The position offers opportunities to work on cutting-edge GPU architectures.

Qualificações

  • Proficiency with C++ and low-level programming (at least C++ 17).
  • Proficiency in CUDA or HIP / ROCm programming.
  • Strong understanding of GPU architectures, parallel programming models, and optimization techniques.

Responsabilidades

  • Help optimize HIP kernels for specific AMD hardware.
  • Collaborate with development teams to enhance GPU-accelerated applications.
  • Debug, profile, and fine-tune code for performance improvements.

Conhecimentos

C++
CUDA or HIP programming
GPU architectures
Parallel programming models
Python
AI/ML/DL/NN/NLP/Computer Vision

Ferramentas

Linux
gdb/LLDB
Descrição da oferta de emprego
Project description

Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team. We are seeking an experienced individual proficient in GPGPU applications to join our team. The primary responsibility of this role will be to lead the effort in optimizing HIP kernels on AMD GPUs. The candidate should possess a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Additionally, familiarity with optimization techniques is highly desirable.

Responsibilities
  • The main task will be to help optimize HIP kernels for specific AMD hardware
  • Collaborate with development teams to optimize and enhance GPU-accelerated applications
  • Debug, profile, and fine-tune code for performance improvements
  • Stay updated with the latest advancements in GPU architectures and programming models
SKILLS
Must have
  • Proficiency with C++ and low-level programming (at least C++ 17)
  • Proficiency in CUDA or HIP / ROCm programming
  • Solid understanding of GPU architectures, parallel programming models, and optimization techniquesStrong problem-solving skills and the ability to work in a collaborative environment
  • One of AI/ML/DL/NN/NLP/Computer Vision experience
  • Python
Nice to have
  • Linux
  • CPU Intrinsics (AVX/SSE)
  • GPU Assembler
  • Profiling
  • gdb/LLDB
  • Jinja2 or similar templating engines
Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.