Job Search and Career Advice Platform

Attiva gli avvisi di lavoro via e-mail!

Senior Deep Learning Software Engineer, Inference

NVIDIA

Italia

In loco

EUR 70.000 - 90.000

Tempo pieno

Ieri
Candidati tra i primi

Genera un CV personalizzato in pochi minuti

Ottieni un colloquio e una retribuzione più elevata. Scopri di più

Descrizione del lavoro

A leading tech company is seeking a Senior Deep Learning Software Engineer specializing in Inference. The role involves optimizing GPU-accelerated software for AI applications and improving high-performance deep learning frameworks. Candidates should have 5+ years of software development experience and a Master's or PhD in a relevant field. This full-time position offers a dynamic environment focused on deep learning advancements. Salary will be competitive and based on location and experience.

Competenze

  • 5+ years of relevant software development experience.
  • Prior experience with training, deploying, or optimizing DL models.
  • Background in performance modeling, profiling, and code optimization.

Mansioni

  • Design, build, and optimize GPU‑accelerated software for AI.
  • Optimize performance of DL models in various domains.
  • Collaborate with teams on inference optimization solutions.

Conoscenze

C/C++ programming
Software design
Agile development
Python

Formazione

Masters or PhD in Computer Engineering, Computer Science, or related field
Descrizione del lavoro
Senior Deep Learning Software Engineer, Inference

Join to apply for the Senior Deep Learning Software Engineer, Inference role at NVIDIA

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will design, build, and optimize GPU‑accelerated software that powers today’s most sophisticated AI applications.

Our team develops and maintains high‑performance deep learning frameworks, including SGLang and vLLM, at the forefront of efficient large‑scale model serving and inference. You will improve these platforms, facilitate deployment and serving of groundbreaking language models, and implement the latest algorithms for public release in frameworks like SGLang, vLLM, and other DL frameworks.

What You’ll Be Doing
  • Performance optimization, analysis, and tuning of DL models in domains like LLM, multimodal, and generative AI.
  • Scale performance of DL models across different NVIDIA accelerators.
  • Contribute features and code to NVIDIA’s inference libraries, vLLM, SGLang, FlashInfer, and LLM software solutions.
  • Collaborate with cross‑framework teams across NVIDIA libraries and inference optimization solutions.
What We Need To See
  • Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI).
  • 5+ years of relevant software development experience.
  • Excellent C/C++ programming and software design skills. SW Agile skills are helpful and Python experience is a plus.
  • Prior experience with training, deploying or optimizing the inference of DL models in production is a plus.
  • Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus.
Ways To Stand Out From The Crowd
  • Contribute to Deep Learning Software projects, such as PyTorch, vLLM, and SGLang to drive advancements in the field.
  • Experience with Multi‑GPU Communications (NCCL, NVSHMEM).
  • Experience building and shipping products to enterprise customers.
  • GPU programming experience (CUDA, OAI TRITON or CUTLASS).

NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High‑Performance Computing, and Visualization. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. For Poland: The base salary range is 213,750 PLN - 370,500 PLN for Level 3, and 281,250 PLN - 487,500 PLN for Level 4.

JR1997930

Seniority level
  • Mid‑Senior level
Employment type
  • Full‑time
Job function
  • Computer Hardware Manufacturing, Software Development, and Computers and Electronics Manufacturing
Ottieni la revisione del curriculum gratis e riservata.
oppure trascina qui un file PDF, DOC, DOCX, ODT o PAGES di non oltre 5 MB.