Enable job alerts via email!

Research Engineer - Post-training

Magic

Seattle (WA)

On-site

USD 100,000 - 550,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a Research Engineer to develop techniques for maximizing model performance in real-world applications. The role involves leveraging data and compute to enhance software engineering tasks through autonomous AI. Candidates should have strong software engineering skills and experience with reinforcement learning techniques. The company offers a competitive salary and benefits, including equity and unlimited paid time off.

Benefits

401(k) plan with 6% salary matching
Generous health, dental and vision insurance
Unlimited paid time off
Visa sponsorship and relocation stipend

Qualifications

  • Strong experience deploying and fine-tuning LLMs for real-world applications.
  • Thorough knowledge of the deep learning literature.

Responsibilities

  • Research and develop innovative post-training techniques and reinforcement learning strategies.
  • Build dynamic reward systems and feedback pipelines for software development.

Skills

Software Engineering
Reinforcement Learning
Data Quality

Job description

Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.

About the role:

As a Research Engineer in post-training, you'll help develop novel techniques and datasets to maximize model performance for real-world applications, leveraging data and compute at scale. You’ll enable our models to complete engineering, code review, and software design tasks in large, real-world codebases while incorporating cutting-edge reinforcement learning (RL) methods.

What you might work on:
  • Research and develop innovative post-training techniques and reinforcement learning strategies to enable models to autonomously generate, debug, and optimize software

  • Build dynamic reward systems and feedback pipelines to align model outputs with human-like decision-making in software development

  • Scale up synthetic dataset generation and evaluations to drive iterative improvements in autonomous coding and problem-solving tasks

  • Improve model capabilities for generating substantial, high-quality, functional code

  • Design scalable approaches for evaluations and synthetic dataset generation that align with reinforcement learning objectives

  • Explore and implement novel methods to align AI behavior with human intent, ensuring reliability and performance in high-stakes environments

What we’re looking for:
  • Strong experience deploying and fine-tuning LLMs for real-world applications.

  • Strong general software engineering skills

  • Thorough knowledge of the deep learning literature

  • Expertise in reinforcement learning techniques such as actor-critic, self-play or self-evaluation, or RLHF

  • Ability to come up with and evaluate novel research ideas

  • Obsession with details, reliability, and good testing to ensure data quality and integrity

  • Willingness to dive deeply into a large ML codebase to debug

  • Passion for building systems that redefine software engineering through fully autonomous AI

Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience.

Our culture:
  • Integrity. Words and actions should be aligned

  • Hands-on. At Magic, everyone is building

  • Teamwork. We move as one team, not N individuals

  • Focus. Safely deploy AGI. Everything else is noise

  • Quality. Magic should feel like magic

Compensation, benefits and perks (US):
  • Annual salary range: $100K - $550K

  • Equity is a significant part of total compensation, in addition to salary

  • 401(k) plan with 6% salary matching

  • Generous health, dental and vision insurance for you and your dependents

  • Unlimited paid time off

  • Visa sponsorship and relocation stipend to bring you to SF, if possible

  • A small, fast-paced, highly focused team

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale AI, Inc.

Seattle

On-site

USD 220,000 - 325,000

10 days ago

Research Engineer, Speech & Language - Generative AI

Meta

Bellevue

On-site

USD 8,000 - 251,000

7 days ago
Be an early applicant

Research Engineer, Speech & Language - Generative AI

Meta

Seattle

On-site

USD 8,000 - 251,000

14 days ago

Research Engineer, Speech & Language - Generative AI

Facebook

Bellevue

On-site

USD 8,000 - 251,000

7 days ago
Be an early applicant

Machine Learning Research Scientist / Engineer, Audio

Scale AI

Seattle

On-site

USD 176,000 - 255,000

9 days ago

Machine Learning Research Scientist/Engineer, Audio

Scale AI, Inc.

Seattle

On-site

USD 176,000 - 255,000

12 days ago

Research Engineer, Speech & Language - Generative AI

The Rundown AI, Inc.

Seattle

On-site

USD 8,000 - 251,000

19 days ago

Research Engineer, Computer Vision - Monetization Generative AI

Facebook

Bellevue

On-site

USD 8,000 - 251,000

17 days ago

Research Engineer, Computer Vision - Monetization Generative AI

The Rundown AI, Inc.

Bellevue

On-site

USD 8,000 - 251,000

21 days ago