Enable job alerts via email!

Systems Research Engineer, Machine Learning Systems

CRM Hike

San Francisco (CA)

Remote

USD 160,000 - 230,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative AI company is seeking a Systems Research Engineer to advance their machine learning systems. In this pivotal role, you will collaborate with diverse teams to design and optimize large-scale distributed training systems and a high-performance inference engine. Your expertise in machine learning will help push the boundaries of AI technology, ensuring the platform remains at the forefront of innovation. This position offers competitive compensation, equity, and the flexibility of remote work, making it an exciting opportunity for those passionate about shaping the future of AI.

Benefits

Startup Equity
Health Insurance
Flexible Remote Work
Competitive Compensation

Qualifications

  • Strong background in machine learning systems and distributed learning.
  • Knowledge of ML/AI applications, especially large language models.

Responsibilities

  • Optimize training and inference platforms for better performance.
  • Collaborate with teams to integrate research into software systems.

Skills

Machine Learning Systems
Distributed Learning
Inference Optimization
Problem-Solving Skills
Analytical Skills

Education

Bachelor's Degree in Computer Science
Master's Degree in Electrical Engineering
Ph.D. in Relevant Field

Tools

Performance Profiling Tools

Job description

Systems Research Engineer, Machine Learning Systems

Role

As a Systems Research Engineer specialized in Machine Learning Systems, you will play a crucial role in researching and building the next generation AI platform at Together. Working closely with the modeling, algorithm, and engineering teams, you will design large-scale distributed training systems and a low-latency/high-throughput inference engine that serves a diverse, rapidly growing user base. Your research skills will be vital in staying up-to-date with the latest advancements in machine learning systems, ensuring that our AI infrastructure remains at the forefront of innovation.

Requirements

  • Strong background in machine learning systems, such as distributed learning and efficient inference for large language models and diffusion models
  • Knowledge of ML/AI applications and models, especially foundation models such as large language models and diffusion models, how they are constructed and how they are used
  • Knowledge of system performance profiling and optimization tools for ML systems
  • Excellent problem-solving and analytical skills
  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experience

Responsibilities

  • Optimize and fine-tune existing training and inference platform to achieve better performance and scalability
  • Collaborate with cross-functional teams to integrate cutting edge research ideas into existing software systems
  • Develop your own ideas of optimizing the training and inference platforms and push the frontier of machine learning systems research
  • Stay up-to-date with the latest advancements in machine learning systems techniques and apply many of them to the Together platform

About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Machine Learning Engineer

Willing Tech

San Francisco

Remote

USD 200,000 - 300,000

Today
Be an early applicant

Senior Machine Learning Engineer

Fieldguide

San Francisco

Remote

USD 185,000 - 285,000

4 days ago
Be an early applicant

Senior Staff Engineer - Finance Data Specialist (Remote)

GEICO

San Francisco

Remote

USD 130,000 - 260,000

4 days ago
Be an early applicant

Staff Machine Learning Engineer

VSCO

San Francisco

Remote

USD 225,000 - 255,000

5 days ago
Be an early applicant

Data Engineer

Protecht

San Francisco

Remote

USD 173,000 - 242,000

9 days ago

Senior Machine Learning Engineer

Censys

Los Altos

Remote

USD 182,000 - 228,000

6 days ago
Be an early applicant

Founding Machine Learning Engineer, AI

Recruiting From Scratch

San Francisco

Remote

USD 180,000 - 250,000

3 days ago
Be an early applicant

Senior Software Engineer - Data Lakehouse Infrastructure

TRM Labs

San Francisco

Remote

USD 190,000 - 220,000

12 days ago

Machine Learning Engineer, Core Engineering

Pinterest

California

Remote

USD 129,000 - 268,000

11 days ago