Enable job alerts via email!

Lead Machine Learning Infrastructure Engineer

Cantina Labs

San Francisco (CA)

Hybrid

USD 200,000 - 250,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Tech Lead to spearhead the development of cutting-edge machine learning infrastructure for a groundbreaking AI platform. This pivotal role involves architecting robust ML pipelines and ensuring real-time interactions across diverse platforms. With a focus on scalability and efficiency, you'll collaborate with cross-functional teams to integrate AI models and continuously improve them based on user feedback. If you're passionate about AI's potential to transform social interactions and creativity, this is your chance to make a significant impact in a dynamic environment.

Benefits

Health Care Premiums Fully Paid
Monthly Stipend of $500
15 PTO Days per Year
401(K) Participation from Day One
Parental Leave & Fertility Support
Lunch and Snacks Provided
WFH Equipment for Remote Employees

Qualifications

  • 5+ years in machine learning infrastructure for consumer products.
  • 2+ years of management experience preferred.

Responsibilities

  • Lead design and maintenance of scalable ML infrastructure.
  • Optimize deployment of ML models for performance.

Skills

Machine Learning Infrastructure
Team Leadership
Cloud Platforms (AWS)
Container Orchestration (Docker, Kubernetes)
Python Programming
Machine Learning Frameworks (TensorFlow, PyTorch)
Monitoring and Model Management
Communication Skills

Tools

Docker
Kubernetes
AWS
TensorFlow
PyTorch

Job description

A bit about Cantina:

Cantina, founded by Sean Parker, is a new social platform with the most advanced AI character creator. Build, share, and interact with AI bots and your friends directly in the Cantina or across the internet.

Cantina bots are lifelike, social creatures, capable of interacting wherever humans go on the internet. Recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters. Bots are a new media type that offer a way for creators to share infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text.

If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

A bit about the role:
We are seeking a Tech Lead to guide the development of our machine learning infrastructure team. This role will be critical in scaling our AI systems, which underpin the creation and deployment of highly interactive, multimodal AI characters. You’ll lead the architecture and implementation of robust ML pipelines while managing the infrastructure needed to support real-time interactions across various platforms.

What you’ll do:

  • Lead the design, development, and maintenance of scalable machine learning infrastructure for Cantina’s AI-driven applications.

  • Implement and optimize the deployment of ML models, ensuring low-latency, high-availability performance.

  • Collaborate cross-functionally with product, engineering, and research teams to integrate AI models into our platform.

  • Develop robust monitoring and feedback loops to ensure continuous model improvement based on real-world data and user interactions.

  • Spearhead initiatives to optimize infrastructure for cost, efficiency and scalability.

  • Ensure the machine learning infrastructure meets best practices in security and- reliability.

A bit about you:

  • 5+ years of experience working with machine learning infrastructure in a production environment, preferentially for a consumer facing product.

  • 2+ years of management experience preferred.

  • Proven experience leading teams in building scalable ML systems and pipelines.

  • Expertise with cloud platforms (e.g. AWS) and container orchestration tools (e.g., Docker, Kubernetes).

  • Strong programming skills, with proficiency in Python and experience with ML frameworks such as TensorFlow or PyTorch.

  • Experience with monitoring and managing deployed models, using tools like A/B testing, telemetry, or model performance tracking.

  • Excellent communication skills to work with both technical and non-technical stakeholders.

  • Passion for AI and enthusiasm for its applications in creative and social contexts.

Pay Equity:

In compliance with Pay Transparency Laws, the base salary range for this role is between $200,000-250,000 for those located in the San Francisco Bay Area, New York City and Seattle, WA. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

Benefits Summary:

  • Health Care — 99% of premiums for medical, vision, dental are fully paid for by Cantina, plus One Medical membership.

  • Monthly Stipend — $500/month to use on whatever you’d like!

  • Rest and Recharge — 15 PTO days per year, 9 sick days, 13 paid company holidays, and offices closed for winter break (Christmas Eve to New Years Day)!

  • 401(K) — Eligible to participate on day one of employment.

  • Parental Leave & Fertility Support

  • Competitive Salary & Equity

  • Lunch and snacks provided for in-office employees.

  • WFH equipment provided for full-time hybrid/remote employees.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Principal Platform Engineer – Infrastructure Architect

DIRECTV

San Francisco

Remote

USD 116,000 - 212,000

2 days ago
Be an early applicant

Principal Backend Software Engineer-Pipeline Infrastructure (FULLY REMOTE)

Cisco

San Jose

Remote

USD 182,000 - 252,000

4 days ago
Be an early applicant

Principal Platform Engineer – Infrastructure Architect

DIRECTV

San Diego

Remote

USD 116,000 - 212,000

-1 days ago
Be an early applicant

Principal Backend Software Engineer-Pipeline Infrastructure (FULLY REMOTE)

Splunk

California

Remote

USD 182,000 - 252,000

Yesterday
Be an early applicant

Principal Backend Software Engineer-Pipeline Infrastructure (FULLY REMOTE)

Cisco

Washington

Remote

USD 182,000 - 252,000

4 days ago
Be an early applicant

Principal Backend Software Engineer-Pipeline Infrastructure (FULLY REMOTE)

Cisco

New York

Remote

USD 203,000 - 280,000

4 days ago
Be an early applicant

Principal Backend Software Engineer-Pipeline Infrastructure (FULLY REMOTE)

Cisco

California

Remote

USD 182,000 - 252,000

4 days ago
Be an early applicant

Principal Backend Software Engineer-Pipeline Infrastructure (FULLY REMOTE)

Cisco

Seattle

Remote

USD 203,000 - 280,000

4 days ago
Be an early applicant

Principal Backend Software Engineer-Pipeline Infrastructure (FULLY REMOTE)

Cisco

Massachusetts

Remote

USD 182,000 - 252,000

4 days ago
Be an early applicant