Enable job alerts via email!

Solutions Architect, Generative AI

NVIDIA

Santa Clara (CA)

On-site

USD 148,000 - 236,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company at the forefront of AI technology! As a Solution Architect or Data Scientist, you will leverage your expertise in Deep Learning and Machine Learning to create innovative Generative AI solutions. Collaborate with a dynamic team to tackle complex challenges and optimize enterprise applications. With a focus on cutting-edge technology, you will empower clients to adopt NVIDIA's AI SDKs and APIs, enhancing their operational efficiency. This role offers an exciting opportunity to be part of a team that drives the future of AI in the enterprise sector, making a real impact on how technology transforms industries.

Benefits

Equity
Health Benefits
Flexible Work Hours
Professional Development Opportunities

Qualifications

  • 5+ years experience in Deep Learning and Machine Learning.
  • Strong coding skills in Python, C/C++, Bash, and Linux.
  • Experience with large scale Generative AI applications.

Responsibilities

  • Develop end-to-end Generative AI solutions for enterprise use cases.
  • Build reference architectures for deploying and optimizing workloads.
  • Share expert knowledge through training and engineering contributions.

Skills

Deep Learning
Machine Learning
Python
C/C++
Bash
Linux
Generative AI
Information Retrieval
Model Evaluation
Inference Optimization

Education

Bachelor's Degree in Engineering
Master's Degree in Computer Science
Ph.D. in Data Science

Tools

TensorFlow
PyTorch
Docker
Kubernetes
SLURM

Job description

Do you want to be part of the team that brings Artificial Intelligence (AI) technology to the field? We are looking for a Solution Architect or Data Scientist to join the NVIDIA AI Enterprise (NVAIE) SA Segment team. We specialize on the newest technology and advances in Machine Learning, Deep Learning, Generative AI, and Cloud. The vision of the NVAIE Segment team is to use our deep expertise to guide and enable the successful adoption at data center scale of NVIDIA AI Enterprise Software!

If you are passionate about Generative AI and how it can be applied to solve real-world problems, we should talk. NVIDIA is the world leader in GPU accelerated computing and AI, and is looking for developers like you to design and build enterprise AI solutions using our newest technology. As a member of the NVAIE Segment Solution Architecture team, you will work closely with customers and partners to tackle hard problems in customizing and deploying Generative AI workloads in production at scale.

What you’ll be doing:

  • A huge part of our work involves developing end-to-end Generative AI solutions for enterprise use cases. We help customers adopt NVIDIA AI SDKs and APIs by offering deep technical expertise and designing GPU-accelerated pipelines that optimize compute resource utilization and improve workload performance.

  • We solve customer problems by building solutions using Machine Learning and Deep Learning technology including language and multimodal models, information retrieval, domain customization, reasoning, inferencing, agentic systems, and other sophisticated Generative AI workloads.

  • As we work with customers across multiple industries, we build the reference architectures needed to deploy and optimize workloads at large scale. With this knowledge, we help improve NVIDIA products and build creative solutions to overcome scaling challenges.

  • We contribute to the wider organization and community by sharing our expert knowledge with others. This can vary from product engineering contributions to building and delivering hands-on training.

Above all, you will be part of the team that helps bring NVIDIA technology to life in the Enterprise! We empower you and give you the tools to achieve this with the backing of all of NVIDIA, including other Solution Architects, Product, Engineering and Research teams. You’ll get to be the face and trusted expert advisor that our customers and partners rely on.

What we need to see:

  • Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).

  • 5+ years experience demonstrating an established track record in Deep Learning and Machine Learning; experience with GPUs as well as expertise in using deep learning frameworks such as TensorFlow or PyTorch.

  • Strong coding development and debugging skills. Including experience with Python, C/C++, Bash, and Linux.

  • Real-world development of large scale Gen AI applications, including but not limited to information retrieval, model pre-training and post-training, model and pipeline evaluation, inference optimization, guard-railing, agents, and reasoning systems.

  • Demonstrated experience with cluster orchestration tools including Docker, Kubernetes, and SLURM across cloud service providers and on premise.

  • Demonstrated expertise in optimizing AI training and inference workloads over high-performance networks, including both Ethernet and InfiniBand fabrics.

  • Ability to learn fast and quickly adapt to change.

  • Clear written and oral communications skills with the ability to effectively collaborate with executives and engineering teams.

Ways to stand out from the crowd:

  • Proven expertise and hands-on experience with NVIDIA AI products including NIM, Nemo Retriever, Nemo Microservices, and Nemo Framework.

  • Expertise on NVIDIA Spectrum-X.

  • Experience with NVIDIA Collective Communication Library (NCCL).

  • Extensive engineering and customer experience on projects with multiple collaborators.

  • Show willingness and ability to dig into unfamiliar territories to solve complex problems relying on experience from previous work.

The base salary range is 148,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Principal Technical Architect

salesforce.com, inc.

San Francisco

Remote

USD 185,000 - 248,000

4 days ago
Be an early applicant

A/AI Mch Learn Engnr Prin, Strategic Solutions Architect

Lockheed Martin

Shelton

Remote

USD 177,000 - 354,000

7 days ago
Be an early applicant

A/AI Mch Learn Engnr Prin, Strategic Solutions Architect

Lockheed Martin

Shelton

Remote

USD 177,000 - 354,000

7 days ago
Be an early applicant

AI Security Solutions Architect Santa Clara, California, United States, Remote

Palo Alto Networks, Inc.

Santa Clara

Remote

USD 120,000 - 180,000

30+ days ago

AI/Cloud Solutions Architect (GCP)

DataDirect Networks

Remote

USD 120,000 - 180,000

7 days ago
Be an early applicant

Data/ ML Solutions Architect

Provectus

Mobile

Remote

USD 150,000 - 180,000

-1 days ago
Be an early applicant

Senior Software Engineer, Billing & Expansion Team - US (Remote)

Weights & Biases

San Francisco

Remote

USD 177,000 - 245,000

Yesterday
Be an early applicant

Senior Software Engineer, Identity Team (Remote)

Weights & Biases

San Francisco

Remote

USD 177,000 - 245,000

Yesterday
Be an early applicant

Senior Back End Engineer, Platform San Francisco (Remote)

You.ai

San Francisco

Remote

USD 150,000 - 270,000

2 days ago
Be an early applicant