Enable job alerts via email!

Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Washington (District of Columbia)

Remote

USD 180,000 - 300,000

Full time

21 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is on the lookout for a Senior AI/ML Specialist Solutions Architect to design and implement scalable AI solutions. This exciting opportunity involves working with cutting-edge technologies and contributing to one of the most powerful commercially available supercomputers. You'll be at the forefront of AI innovation, helping clients maximize performance and business value while building long-term relationships. If you have a passion for AI and cloud technologies, this role is perfect for you, offering a flexible remote work environment and competitive compensation.

Benefits

100% company-paid medical, dental, and vision coverage
401(k) plan with a 4% match
Stock options plan
Flexible remote work environment
Company-paid short-term, long-term, and life insurance
20 weeks paid parental leave for primary caregivers
Up to $85/month for mobile and internet
Work with state-of-the-art AI and cloud technologies
Contribute to sustainable AI infrastructure

Qualifications

  • 5+ years of experience in cloud technologies and infrastructure.
  • Proven expertise in scaling AI workloads across multi-node environments.

Responsibilities

  • Architect and optimize distributed training for large-scale AI models.
  • Lead the transition of ML pipelines from POC to production systems.

Skills

Cloud Technologies
AI Workloads Optimization
Machine Learning Frameworks
Exceptional Communication Skills
Python
Go
Java
C++

Education

Bachelor's Degree in Computer Science or related field

Tools

Terraform
Ansible
Kubernetes
Slurm
Git
Docker
Helm
Spark
Kafka
Hadoop

Job description

Job Description

About the Company

Our client is a publicly traded company at the forefront of the AI revolution, offering an AI-centric cloud platform that's reshaping the landscape of artificial intelligence. The company provides cutting-edge infrastructure, including large-scale GPU clusters, cloud platforms, tools, and services for developers to service the explosive growth of the global AI industry for Fortune 1000 companies, top-tier innovative startups, and AI researchers.

  • Company type: Publicly traded

  • Industry: AI/ML, Cloud Computing, Infrastructure-as-Code

  • Candidate Location: Remote U.S.

Their mission is to democratize access to AI infrastructure and empower organizations to create, optimize, and deploy AI solutions at any scale. They aim to simplify the complexities of AI development by providing a full-stack AI platform that combines powerful hardware with user-friendly tools and services.

The Opportunity

We are seeking a Senior AI/ML Specialist Solutions Architect to join our client's team. This role offers the chance to design and implement scalable AI solutions for AI-focused customers, working with state-of-the-art technologies and contributing to one of the most powerful commercially available supercomputers.

What You'll Do
  • Architect and optimize distributed training and inference systems for large-scale AI models

  • Design and deliver customer-focused solutions that maximize performance and business value

  • Lead the transition of ML pipelines from POC to scalable production systems

  • Build long-term customer relationships, ensuring satisfaction and alignment with strategic goals

  • Create whitepapers, deliver technical presentations, and host webinars to share insights and best practices

  • Provide technical leadership and mentor teams on AI infrastructure and deployment strategies

  • Collaborate with engineering and product teams to prioritize customer feedback and influence product roadmaps

What You Bring
  • 5+ years of experience with cloud technologies and infrastructure, ideally in senior MLOps or Solutions Architect roles

  • Proven expertise in scaling and optimizing AI workloads across multi-node and multi-GPU environments

  • Demonstrated success delivering ML products, scaling from POC to production

  • Deep knowledge of ML frameworks like PyTorch and JAX

  • Strong background in the NVIDIA HPC ecosystem (CUDA, NCCL, Infiniband)

  • Exceptional communication skills to engage both technical teams and business stakeholders

  • Legal authorization to work in the United States on a full-time basis without sponsorship

Technical Skills
  • Programming : Python, Go, Java, C++

  • Infrastructure as Code (IaC): Terraform, Ansible

  • Orchestration: Kubernetes (K8s), Slurm

  • DevOps Tools: Git, Docker, Helm

  • Big Data Frameworks: Spark, Kafka, Hadoop

  • Databases: SQL, NoSQL, and vector databases

  • ML Frameworks: PyTorch, TensorFlow, JAX, HuggingFace, Scikit-learn

Why Join?
  • Competitive compensation: $180,000 - $300,000 per year (negotiable based on experience and location)

  • Full medical benefits: 100% company-paid medical, dental, and vision coverage for employees and families

  • 401(k) plan with a 4% match program

  • Stock options plan

  • Flexible remote work environment

  • Company-paid short-term, long-term, and life insurance coverage

  • 20 weeks paid parental leave for primary caregivers, 12 weeks for secondary caregivers

  • Up to $85/month for mobile and internet

  • Work with state-of-the-art AI and cloud technologies, including the latest NVIDIA GPUs

  • Be part of a team that operates one of the most powerful commercially available supercomputers

  • Contribute to sustainable AI infrastructure, with energy-efficient data centers that recover waste heat to warm nearby residential buildings

Interviewing Process
  • Level 1 - Interview with Talent Acquisition

  • Level 2 - Interview with the Hiring Manager

  • Level 3 - Technical Assessment

  • Reference and Background Checks: conducted after successful interviews

  • Job Offer: provided to the selected candidate

We are proud to be an equal opportunity workplace and are committed to equal employment opportunity regardless of marital status, ancestry, physical or mental disability, genetic information, veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by applicable federal, state or local law.

Compensation Range: $180K - $300K

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Dallas

Remote

USD 180,000 - 300,000

8 days ago

Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Denver

Remote

USD 180,000 - 300,000

9 days ago

Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Austin

Remote

USD 180,000 - 300,000

21 days ago

Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Washington

Remote

USD 180,000 - 300,000

21 days ago

Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)

ZipRecruiter

Chicago

Remote

USD 180,000 - 300,000

21 days ago

Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)

ZipRecruiter

New York

Remote

USD 180,000 - 300,000

21 days ago

Senior AI/ML Specialist Solutions Architect (Cloud & AI Infra)

TieTalent

Town of Texas

Remote

USD 180,000 - 300,000

30+ days ago