Enable job alerts via email!

Research Engineer - Distributed Training

Prime Intellect

United States

On-site

USD 120,000 - 180,000

Full time

6 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Prime Intellect is seeking a Research Engineer to drive advancements in decentralized AI model training. This role focuses on optimizing performance and creating a decentralized training orchestration solution while collaborating on open-source projects. Join us to contribute to a mission that aims to democratize AI technology and build an innovative platform for researchers and developers.

Benefits

Competitive compensation including equity and token incentives
Flexible work arrangements
Visa sponsorship and relocation assistance
Quarterly team off-sites and hackathons

Qualifications

  • Strong background in AI/ML engineering.
  • Experience with large-scale model training and distributed training techniques.
  • Solid understanding of MLOps.

Responsibilities

  • Lead research to build decentralized training orchestration solutions.
  • Optimize AI workloads for performance and cost.
  • Contribute to and publish in top-tier AI conferences.

Skills

AI/ML engineering
End-to-end pipeline design
Distributed training techniques
MLOps best practices
Performance optimization

Education

Relevant Bachelor's or Master's degree

Tools

PyTorch
DeepSpeed
Ray

Job description

Join to apply for the Research Engineer - Distributed Training role at Prime Intellect

1 year ago Be among the first 25 applicants

Join to apply for the Research Engineer - Distributed Training role at Prime Intellect

At Prime Intellect, we are on a mission to accelerate open and decentralized AI progress by enabling anyone to contribute compute, code or capital to train powerful, open models. Our ultimate goal? Openly accessible AGI that benefits everyone. But we can't do it alone and we want to do this together with you.

We are building the infrastructure for decentralized AI development at scale. We aggregate global compute and enable researchers to collaboratively train state-of-the-art models through distributed training across clusters.

As a Research Engineer working on Distributed Training, you'll play a crucial role in shaping our technological direction, focusing on our decentralizing AI training stack. If you love scaling things and maximizing training efficiency, this role is for you.

Responsibilities

  • Lead and participate in novel research to build a massive scale, highly reliable and secure decentralized training orchestration solution
  • Optimize the performance, cost, and resource utilization of AI workloads by leveraging the most recent advances for compute & memory optimization techniques.
  • Contribute to the development of our open-source libraries and frameworks for distributed model training.
  • Publish research in top-tier AI conferences such as ICML & NeurIPS.
  • Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers.
  • Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, decentralized training research and proactively identify opportunities to enhance our platform's capabilities and user experience.

Requirements

  • Strong background in AI/ML engineering, with extensive experience in designing and implementing end-to-end pipelines for training and deploying large-scale AI models.
  • Deep expertise in distributed training techniques, frameworks (e.g., PyTorch Distributed, DeepSpeed, MosaicML’s LLM Foundry), and tools (e.g. Ray) for optimizing the performance and scalability of AI workloads.
  • Experience in large-scale model training incl. distributed training techniques such as data, tensor & pipeline parallelism
  • Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines.
  • Passion for advancing the state-of-the-art in decentralized AI model training and democratizing access to AI capabilities for researchers, developers, and businesses worldwide.
  • If you're not familiar with these, but feel like that you can contribute to our mission and you're a high-energy person, get familiar with these resources (here, here and here) and please reach out!

Benefits & Perks

  • Competitive compensation, including equity and token incentives, aligning your success with the growth and impact of Prime Intellect.
  • Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco.
  • Visa sponsorship and relocation assistance for international candidates.
  • Quarterly team off-sites, hackathons, conferences and learning opportunities.
  • Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI.

We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.

If you're excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers developers and researchers to push the boundaries of what's possible, we'd love to hear from you.

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    Software Development

Referrals increase your chances of interviewing at Prime Intellect by 2x

Get notified about new Research Engineer jobs in United States.

Software Engineer - Fullstack, Multiple Locations

United States $81,900.00-$174,600.00 1 week ago

Full-Stack Software Engineer (New graduates: United States)

United States $70,000.00-$100,000.00 2 weeks ago

Software Engineer (L5) - Open Connect Platform

New York, NY $140,000.00-$185,000.00 5 days ago

Software Engineer (L5) - Ads Identity & Privacy

United States $100,000.00-$720,000.00 1 week ago

Software Engineer L4/L5, Model Serving Systems, Machine Learning Platform

United States $100,000.00-$720,000.00 1 week ago

New York, United States $142,600.00-$196,200.00 5 days ago

Software Engineer Intern/Co-op (Fall 2025)

United States $140,000.00-$170,000.00 1 month ago

New York, NY $145,000.00-$260,000.00 7 months ago

United States $90,000.00-$170,000.00 9 months ago

Fort Myers, FL $80,000.00-$100,000.00 13 hours ago

Software Engineer - AI/ML, Multiple Locations

United States $81,900.00-$174,600.00 1 week ago

Orange County, CA $75,000.00-$85,000.00 20 hours ago

Software Engineer - AI/ML, Multiple Locations

Redmond, WA $81,900.00-$174,600.00 1 week ago

Software Engineering Intern (September 2025)

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Research Engineer

Foundation EGI

On-site

USD 150 000 - 300 000

6 days ago
Be an early applicant

Sr. Threat Detection Research Engineer (Remote)

CrowdStrike

Remote

USD 135 000 - 215 000

3 days ago
Be an early applicant

Threat Detection Research Engineer (Remote)

CrowdStrike

Remote

USD 110 000 - 180 000

3 days ago
Be an early applicant

[Hiring] Senior Research Engineer @Output

併瑰畴

Remote

USD 155 000 - 195 000

24 days ago

Senior Staff Algorithm Research Engineer

Insulet Corporation

Massachusetts

Remote

USD 142 000 - 215 000

18 days ago

RESEARCH SCIENTIST/ENGINEER 4

University of Washington

Seattle

Remote

USD 150 000 - 200 000

4 days ago
Be an early applicant

Senior Research Software Engineer

Source One Technical Solutions

Los Altos

Remote

USD 150 000 - 200 000

5 days ago
Be an early applicant

Machine Learning Research Engineer

Qualcomm

San Diego

On-site

USD 179 000 - 269 000

4 days ago
Be an early applicant

Senior Research Engineer, Computer Vision

Autodesk

Remote

USD 100 000 - 130 000

11 days ago