Enable job alerts via email!

Senior AI Infrastructure Engineer

CRM Hike

San Francisco (CA)

Remote

USD 160,000 - 230,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A forward-thinking company seeks a Senior AI Infrastructure Engineer to build a cutting-edge, multi-cloud PaaS platform. This role involves utilizing open-source technologies to enhance AI workloads and drive innovation. You'll be a technology thought leader, collaborating with a passionate team to create robust services and tools. The position offers competitive compensation, equity, and benefits, along with the flexibility of remote work. Join this mission-driven organization to make a significant impact in the AI landscape and contribute to groundbreaking advancements.

Benefits

Startup Equity
Health Insurance
Flexible Remote Work
Competitive Compensation

Qualifications

  • 5+ years of software development experience, proficient in backend programming.
  • Experience with cloud microservices architectures and multiple cloud providers.

Responsibilities

  • Perform architecture and research for decentralized AI workloads.
  • Work on the core, open-source Together AI platform.

Skills

Software Development
Communication Skills
Collaboration
Infrastructure as Code
Troubleshooting
Cloud Microservices Architecture
Kubernetes
GPU Programming
AI Workloads

Education

Bachelor's Degree in Computer Science or related field

Tools

Terraform
Ansible
Pytorch
TensorFlow

Job description

As a Senior AI Infrastructure Engineer, you will be responsible for building the next generation, highly available, global, multi-cloud PaaS platform with open-source technologies to enable and accelerate Together AI’s rapid growth.

This system spans many diverse environments (Kubernetes, VMs, bare metal compute, and edge deployments) and provides a cohesive and reliable abstraction for running AI workloads in them. You will get to be a technology thought leader, evangelize new, cutting-edge technologies, and solve complex problems.

To be successful, you’ll need to be deeply technical and possess excellent communication, collaboration, and diplomacy skills. You have experience practicing infrastructure-as-code, including using tools like Terraform and Ansible. You have strong software development fundamentals and skills. In addition, you have strong systems knowledge and troubleshooting abilities.

Requirements

  • 5+ years of professional software development experience and proficiency in at least one backend programming language (Golang desired)
  • Demonstrated experience with high performance or distributed cloud microservices architectures and ideally experience building them in operation at a global scale using multiple cloud providers such as AWS, Azure, or GCP
  • Excellent understanding of low level operating systems concepts including multi-threading, memory management, networking and storage, performance, and scale
  • Pragmatic, methodical, well-organized, detail-oriented, and self-starting
  • Experience with Kubernetes and containerization, VPNs, AI workloads, and blockchain based protocols a plus
  • GPU programming, NCCL, CUDA knowledge a plus
  • Experience with Pytorch or Tensorflow a plus
  • 5+ years experience writing high-performance, well-tested, production quality code

Responsibilities

  • Perform architecture and research work for decentralized AI workloads
  • Work on the core, open-source Together AI platform
  • Create services, tools, and developer documentation
  • Create testing frameworks for robustness and fault-tolerance

About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior AI Infrastructure Engineer

T-MOBILE USA, Inc.

Town of Texas

On-site

USD 113,000 - 205,000

3 days ago
Be an early applicant

Senior Software Engineer: Infrastructure

DigitalOcean

San Francisco

Remote

USD 170,000 - 220,000

Yesterday
Be an early applicant

Senior AI Infrastructure Engineer - DGX Cloud

NVIDIA Corporation

Santa Clara

On-site

USD 148,000 - 288,000

6 days ago
Be an early applicant

Senior AI Infrastructure Engineer - DGX Cloud

NVIDIA

Santa Clara

On-site

USD 148,000 - 288,000

11 days ago

Senior AI Infrastructure Engineer - DGX Cloud

NVIDIA

Remote

USD 144,000 - 271,000

21 days ago

High Performance Computing and AI Infrastructure Engineer, Sr

Lockheed Martin

Remote

USD 89,000 - 179,000

2 days ago
Be an early applicant

Senior Software Engineer: Infrastructure

DigitalOcean

Denver

Remote

USD 130,000 - 170,000

Yesterday
Be an early applicant

Senior Software Engineer: Infrastructure

DigitalOcean

Seattle

Remote

USD 130,000 - 170,000

Yesterday
Be an early applicant

Senior Infrastructure Engineer

Upstart

Remote

USD 163,000 - 227,000

Yesterday
Be an early applicant