Enable job alerts via email!

Solutions Architect - Generative AI

NVIDIA

Santa Clara (CA)

On-site

USD 168,000 - 322,000

Full time

12 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

NVIDIA is seeking a Solutions Architect for its AI Operations Team to architect and implement large-scale AI projects in their Digital Marketing Organization. The role involves deep knowledge of applied AI trends and requires exceptional communication skills and Python expertise. Candidates should be passionate about leveraging NVIDIA's latest generative AI technologies for impactful solutions in a collaborative environment.

Benefits

Equity
Generous benefits package

Qualifications

  • 8+ years of hands-on experience in a technical role.
  • Experience with generative AI and deploying AI-powered solutions at scale.
  • Strong knowledge of cloud and datacenter GPU systems.

Responsibilities

  • Architect end-to-end generative AI applications focusing on LLM deployment.
  • Collaborate with diverse teams to deliver tailored AI solutions.
  • Implement strategies for efficient AI workflows using NVIDIA's technologies.

Skills

Python programming
Collaboration
Communication
AI applications

Education

Master's or Ph.D. in Computer Science or related field

Tools

CUDA
TensorRT
Docker
Kubernetes

Job description

Join to apply for the Solutions Architect - Generative AI role at NVIDIA

1 day ago Be among the first 25 applicants

Join to apply for the Solutions Architect - Generative AI role at NVIDIA

Direct message the job poster from NVIDIA

Advocate for Inclusion | Investor | Board Member

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by phenomenal technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing and transform industries. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Join the team and make a lasting impact on the world!

We’re looking for a Solutions Architect to join our AI Operations Team to architect, lead, and deliver large-scale AI projects for our Digital Marketing Organization. This position requires a deep knowledge of the latest trends in applied AI, with a strong understanding of Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG). The ideal candidate should have specialized expertise in implementing end-to-end AI workflows and be an excellent communicator, able to work with globally dispersed development, product, and business groups. Come help lead our efforts to use NVIDIA's latest generative AI technologies in production-ready AI features across our websites.

What You Will Be Doing

  • Architect end-to-end generative AI applications for the Digital Marketing Organization with a focus on LLM deployment and RAG workflows.
  • Get hands-on and use advanced Python programming knowledge to make valuable contributions at both the application and infrastructure levels.
  • Provide technical leadership and guidance on standard methodologies for training LLMs and implementing RAG-based solutions.
  • Work with our primary collaborators, NVIDIA’s Marketing Team, to understand their requirements and deliver tailored solutions to their requests as well as partner with the Digital Marketing Org’s AI Development Team and other development resources to complete projects.
  • Collaborate closely with our globally dispersed development, MLOps, product, engineering, and business teams.
  • Implement strategies for efficiently and effectively implementing AI workflows and agents to achieve optimal performance using NVIDIA’s hardware and software platforms.
  • Lead workshops and design sessions with our Digital Marketing Development Teams to define and refine generative AI solutions focused on LLMs and RAG workflows.
  • Design and implement RAG-based workflows to enhance content generation and information retrieval.
  • Work closely with NVIDIA engineering and product teams to provide feedback and contribute to the evolution of generative AI software.
  • Work closely with the Digital Marketing Org’s Web and Platform Teams to integrate RAG workflows into their applications and systems.

What We Need To See

  • Master's or Ph.D. in Computer Science, Artificial Intelligence, or a related field; or equivalent experience in building and deploying AI-powered solutions at scale.
  • 8+ years of hands-on experience in a technical role, including experience with generative AI.
  • Advanced proficiency in Python programming, with the ability to contribute at both the application and infrastructure levels.
  • Knowledge of building Agentic frameworks and multi-agent applications using Langchain, Langgraph, etc.
  • Hands-on experience with or understanding of NVIDIA’s hardware and software technologies (e.g. CUDA, Triton, TensorRT, NeMo, RAPIDS, etc.)
  • Proven record of successfully deploying and optimizing LLM models for inference in production environments.
  • In-depth understanding of state-of-the-art language models, such as modern open models (e.g. Llama, Mistral) and proprietary APIs (e.g. ChatGPT, Claude, Gemini).
  • Expertise in training and fine-tuning LLMs using NVIDIA NeMo Framework and other popular frameworks.
  • Strong knowledge of cloud and datacenter GPU systems
  • Excellent communication and collaboration skills with the ability to articulate complex technical concepts to both technical and non-technical team members.
  • Experience leading workshops, training sessions, and communicating technical solutions to diverse audiences.

Ways To Stand Out From The Crowd

  • Experience in deploying LLM models in cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure.
  • Experience working with any agentic models/frameworks.
  • Working experience with Observability and Evaluation tools
  • Familiarity with containerization technologies (e.g., Docker) and orchestration tools (e.g., ECS, Kubernetes) for scalable and efficient model deployment.
  • Hands-on experience with NVIDIA GPU technologies, and GPU cluster management, and ability to design and implement scalable and efficient workflows for LLM training and inference on GPU clusters

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers; we have some of the most forward-thinking and hardworking people in the world working for us and, due to unparalleled growth, best-in-class teams are rapidly growing. If you’re creative and autonomous with a real passion for your work, we want to hear from you!

The base salary range is 168,000 USD - 322,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

JR1997795

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Industries
    Computer Hardware Manufacturing, Software Development, and Computers and Electronics Manufacturing

Referrals increase your chances of interviewing at NVIDIA by 2x

Get notified about new Solutions Architect jobs in Santa Clara, CA.

San Francisco Bay Area $105.00-$110.00 2 weeks ago

Senior Solutions Architect (m/f/d) - AMERICAS
Senior Solutions Architect, Global Partner Team

San Mateo, CA $220,000.00-$260,000.00 1 week ago

Santa Clara, CA $155,000.00-$215,000.00 4 months ago

Solutions Architect - Cloud Providers and Hyperscale
Solutions Architect - AI/Kubernetes Start Up Vendor

San Francisco Bay Area $160,000.00-$200,000.00 2 weeks ago

Mountain View, CA $150,000.00-$242,000.00 1 month ago

AI Solution Architect, Deployment - France, UK
Architect / Technical Lead - System / Packaging / Thermal

Palo Alto, CA $200,000.00-$235,000.00 2 weeks ago

Solutions Architect, Networking - Cloud Service Providers

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Global Solutions Architect Cloud

Palo Alto Networks

Santa Clara

Remote

USD 145,000 - 180,000

5 days ago
Be an early applicant

Senior Solutions Architect

MongoDB

San Francisco

Remote

USD 104,000 - 265,000

Yesterday
Be an early applicant

Data Solutions Architect

Sphere Software

Remote

USD 120,000 - 180,000

5 days ago
Be an early applicant

Integrated Justice Architect - Government Sector - Manager - Consulting - Location Open

AECOM

San Mateo

Remote

USD 169,000 - 295,000

4 days ago
Be an early applicant

AI Solution Architect (Manager) - Remote

MxD

Chicago

Remote

USD 145,000 - 170,000

5 days ago
Be an early applicant

Solutions Architect, Generative AI Specialist

NVIDIA

Remote

USD 148,000 - 288,000

5 days ago
Be an early applicant

Software Engineer-SMTS-Specific Skills Apex, Java-CIO Org

The Fountain Group

San Francisco

Remote

USD 200,000 - 250,000

5 days ago
Be an early applicant

Software Engineer II (Capacity Engineering)

Affirm

San Jose

Remote

USD 160,000 - 210,000

14 days ago

Technical Solution Architect - Remote with 10-20% Travel

The Dignify Solutions, LLC

Raritan

Remote

USD 150,000 - 170,000

Yesterday
Be an early applicant