Enable job alerts via email!

Key Customers Solutions Architect (Cloud & AI Infra)

ZipRecruiter

San Francisco (CA)

Remote

USD 180,000 - 300,000

Full time

22 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Key Customers Solution Architect to drive AI innovation. This role offers the opportunity to work with cutting-edge AI and cloud technologies, including the latest NVIDIA GPUs. You will be the primary technical point of contact for clients, helping them optimize their GPU performance for machine learning workloads. Your expertise will not only enhance customer satisfaction but also contribute to the development of sustainable AI infrastructure. Join a dynamic team that values creativity and collaboration while making significant impacts in the AI landscape.

Benefits

100% company-paid medical, dental, and vision coverage
401(k) plan with 4% match
Stock options
Flexible remote work environment
20 weeks paid parental leave
Company-paid insurance coverage
Up to $85/month for mobile and internet
Work with cutting-edge AI technologies
Be part of a powerful supercomputer team
Contribute to sustainable AI infrastructure

Qualifications

  • 5+ years in cloud services and AI/ML workloads.
  • Experience with GPU computing and deep learning frameworks.
  • Strong project management and customer-centric skills.

Responsibilities

  • Serve as technical point of contact for AI/ML challenges.
  • Guide customers in optimizing GPU performance.
  • Collaborate with sales to identify new opportunities.

Skills

Cloud Solutions Architect
Technical Account Manager
Customer Engineer
Infrastructure as Code (IaC)
Kubernetes
Python programming
GPU computing
HPC/ML orchestration frameworks
Deep learning frameworks
Stakeholder negotiation

Education

Bachelor's degree in relevant field

Tools

Terraform
Ansible
CUDA
OpenCL
PyTorch
TensorFlow
AWS
Azure
Google Cloud

Job description

Job Description

About the Company

Our client is at the forefront of the AI revolution, providing cutting-edge infrastructure that's reshaping the landscape of artificial intelligence. They offer an AI-centric cloud platform that empowers Fortune 500 companies, top-tier innovative startups, and AI researchers to drive breakthroughs in AI. This publicly traded company is committed to building full-stack infrastructure to service the explosive growth of the global AI industry, including large-scale GPU clusters, cloud platforms, tools, and services for developers.

  • Company Type: Publicly traded

  • Product: AI-centric GPU cloud platform & infrastructure for training AI models

  • Candidate Location: Remote anywhere in the US

Their mission is to democratize access to world-class AI infrastructure, enabling organizations of all sizes to turn bold AI ambitions into reality. At the core of their success is a culture that celebrates creativity, embraces challenges, and thrives on collaboration.

The Opportunity

We are seeking a Key Customers Solution Architect to join our client's team. This role offers the chance to work with state-of-the-art AI and cloud technologies, including the latest NVIDIA GPUs, and contribute to sustainable AI infrastructure development.

What You'll Do

  • Serve as the primary technical point of contact, troubleshooting and resolving complex AI/ML challenges.

  • Guide customers in optimizing GPU performance for ML training and inference workloads, and be a single point of technical expertise for users.

  • Partner with the sales team to identify new opportunities and promote the latest products.

  • Act as a bridge to product teams, providing customer feedback and ensuring alignment with customer requirements.

  • Engage with stakeholders, negotiate solutions, and drive alignment to address customer challenges.

What You Bring

  • 5+ years in roles like Cloud Solutions Architect, Technical Account Manager, or Customer Engineer, with hands-on experience in cloud services and AI/ML workloads.

  • Proficiency in Infrastructure as Code (IaC) tools like Terraform and Ansible.

  • Experience with Kubernetes and Python programming.

  • Solid understanding of GPU computing, including ML training, inference workloads, and GPU stacks (e.g., CUDA, OpenCL).

  • Customer-centric approach with a proven ability to build trust and foster long-term relationships.

  • Strong ability to explain technical concepts to technical and non-technical audiences.

  • Hands-on experience with HPC/ML orchestration frameworks (e.g., Slurm, Kubeflow).

  • Experience with deep learning frameworks (e.g., PyTorch, TensorFlow).

  • Familiarity with ML tools from NVIDIA, AWS, Azure, and Google Cloud providers.

  • Strong project management skills, with the ability to prioritize tasks and deliver on deadlines.

  • Proven experience mentoring technical teams and driving team growth.

  • Expertise in stakeholder negotiation to support problem resolution and ensure seamless collaboration.

  • Legal authorization to work in the United States on a full-time basis without sponsorship.

Key Attributes for Success

  • Passion for AI and transformative technologies.

  • A genuine interest in optimizing and scaling ML solutions for high-impact results.

  • Results-driven mindset and problem-solver mentality.

  • Adaptability and ability to thrive in a fast-paced startup environment.

  • Comfortable working with an international team and diverse client base.

  • Communication and collaboration skills, with experience working in cross-functional teams.

Why Join?

  • Competitive compensation ranging from $180,000 to $300,000 per year (negotiable based on experience and location).

  • Comprehensive benefits package, including 100% company-paid medical, dental, and vision coverage for employees and families.

  • 401(k) plan with a 4% match program and stock options plan.

  • Flexible remote work environment.

  • 20 weeks paid parental leave for primary caregivers, 12 weeks for secondary caregivers.

  • Company-paid short-term, long-term, and life insurance coverage.

  • Up to $85/month for mobile and internet.

  • Opportunity to work with cutting-edge AI technologies, including the latest NVIDIA GPUs (H100, L40S, with H200 and Blackwell chips coming soon).

  • Be part of a team operating one of the most powerful commercially available supercomputers.

  • Contribute to sustainable AI infrastructure, with energy-efficient data centers that recover waste heat to warm nearby residential buildings.

Interviewing Process

  • Level 1: Virtual interview with the Talent Acquisition Lead (General fit, Q&A).

  • Level 2: Virtual interview with the Hiring Manager (Skills assessment).

  • Level 3: Interview with the C-level (Final round).

  • Reference and Background Checks: Conducted post-interviews.

  • Offer: Extended to the selected candidate.

We are proud to be an equal opportunity workplace and are committed to equal employment opportunity regardless of marital status, ancestry, physical or mental disability, genetic information, veteran status, sexual orientation, or expression, or any other characteristic protected by applicable federal, state or local law.

Compensation Range: $180K - $300K

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Key Customers Solutions Architect (Cloud & AI Infra)

ZipRecruiter

San Jose

Remote

USD 180,000 - 300,000

13 days ago

Cloud Solutions Architect (Cloud & AI Infra)

ZipRecruiter

San Francisco

Remote

USD 180,000 - 300,000

13 days ago

Principal AI/ML Infra and Ops Engineering

UnitedHealth Group

San Francisco

Remote

USD 106,000 - 195,000

2 days ago
Be an early applicant

Key Customers Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Chicago

Remote

USD 180,000 - 300,000

8 days ago

Key Customers Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Portland

Remote

USD 180,000 - 300,000

9 days ago

Principal AI/ML Infra and Ops Engineering

Optum

San Francisco

Remote

USD 106,000 - 195,000

5 days ago
Be an early applicant

Principal AI/ML Infra and Ops Engineering - 2287523

UnitedHealth Group

San Francisco

Remote

USD 106,000 - 195,000

5 days ago
Be an early applicant

Cloud Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Las Vegas

Remote

USD 180,000 - 300,000

6 days ago
Be an early applicant

Cloud Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Washington

Remote

USD 180,000 - 300,000

6 days ago
Be an early applicant