Enable job alerts via email!

Senior Software Engineer, AI Systems - vLLM and MLPerf

NVIDIA

Myrtle Point (OR)

On-site

USD 300,000 - 357,000

Full time

Today
Be an early applicant

Job summary

A leading technology company in Oregon is seeking a highly skilled Software Engineer to design and implement inference systems for cutting-edge AI models. The ideal candidate will have a strong background in Python and C++, deep learning, and collaborative experience with ML frameworks. This role offers a competitive salary, equity, and comprehensive benefits, along with opportunities for career advancement.

Benefits

Comprehensive benefits package
Opportunity for career advancement
Collaborative work environment

Qualifications

  • 5+ years of experience in software development, preferably with Python and C++.
  • Deep understanding of deep learning algorithms, distributed systems, and high-performance computing principles.
  • Hands-on experience with ML frameworks and inference engines.

Responsibilities

  • Design and implement efficient inference systems for large-scale generative AI models.
  • Define benchmarking methodologies and build tools for industry-wide adoption.
  • Collaborate with researchers and engineers to productionize advanced model architectures.

Skills

Python
C++
Deep learning algorithms
GPU programming
Distributed systems
High-performance computing
ML frameworks
Performance profiling tools

Education

Bachelor's, Master's, or PhD in Computer Science/Engineering

Tools

PyTorch
vLLM
CUDA
NCCL
Docker
AWS
GCP
Azure
Job description
Overview

Employer Industry: Technology - Artificial Intelligence

Why consider this job opportunity
  • Salary range up to $356,500 for Level 5 positions
  • Eligibility for equity and comprehensive benefits package
  • Opportunity for career advancement and growth within the organization
  • Work in a collaborative environment with leading experts in AI and performance optimization
  • Engage in cutting-edge research and contribute to academic publications
  • Chance to work on high-impact software that pushes the boundaries of AI technology
What to Expect (Job Responsibilities)
  • Design and implement efficient inference systems for large-scale generative AI model deployments
  • Define benchmarking methodologies and build tools for industry-wide adoption
  • Develop, profile, debug, and optimize system components for MLPerf Inference benchmarks
  • Collaborate with researchers and engineers to productionize advanced model architectures and inference techniques
  • Participate in design discussions and technical planning to align products with business goals
What is Required (Qualifications)
  • Bachelor’s, Master’s, or PhD degree in Computer Science/Engineering, Software Engineering, or related field, or equivalent experience
  • 5+ years of experience in software development, preferably with Python and C++
  • Deep understanding of deep learning algorithms, distributed systems, and high-performance computing principles
  • Hands-on experience with ML frameworks (e.g., PyTorch) and inference engines (e.g., vLLM)
  • Familiarity with GPU programming, CUDA, NCCL, and performance profiling tools
How to Stand Out (Preferred Qualifications)
  • Background in building and optimizing LLM inference engines like vLLM and SGLang
  • Experience with cloud platforms (e.g., AWS, GCP, or Azure) and containerization tools (e.g., Docker)
  • Exposure to DevOps practices, CI/CD pipelines, and infrastructure as code
  • Contributions to open-source projects, including a list of GitHub PRs submitted

#ArtificialIntelligence #SoftwareEngineering #CareerOpportunity #HighImpactSoftware #DiversityInTech

We prioritize candidate privacy and champion equal-opportunity employment. Central to our mission is our partnership with companies that share this commitment. We aim to foster a fair, transparent, and secure hiring environment for all. If you encounter any employer not adhering to these principles, please bring it to our attention immediately.

We are not the EOR (Employer of Record) for this position. Our role in this specific opportunity is to connect outstanding candidates with a top-tier employer.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.