Enable job alerts via email!

AI/ML Research Engineer - LLM Post-Training with RL

P-1 AI

San Francisco (CA)

On-site

USD 120,000 - 160,000

Full time

23 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in AI is seeking a Research Engineer to develop advanced AI systems capable of quantitative reasoning. The role involves training LLMs and requires strong programming and machine learning skills. Ideal candidates will thrive in a dynamic startup environment and have a rigorous academic background.

Qualifications

Strong programming skills and deep understanding of machine learning.
Experience with LLM post-training focused on RL.

Responsibilities

Responsible for RLEF training of LLM/agentic systems.
Build AI systems with quantitative reasoning capabilities.

Skills

Programming

Machine Learning

Debugging

Quantitative Reasoning

Education

Background in Statistical Machine Learning

Background in Physics

Background in Mathematics

Tools

Python

PyTorch

C++

> about P-1 AI

We are building an engineering AGI. We founded P-1 AI with the conviction that the greatest impact of artificial intelligence will be on the built world—helping mankind conquer nature and bend it to our will. Our first product is Archie, an AI engineer capable of quantitative and spatial reasoning over physical product domains that performs at the level of an entry-level design engineer. We aim to put an Archie on every engineering team at every industrial company on earth.

Our founding team includes the top minds in deep learning, model-based engineering, and industries that are our customers. We just closed a $23 million seed round led by Radical Ventures that includes a number of other AI and industrial luminaries (from OpenAI, DeepMind, etc.).

> about the role

As a Research Engineer here, you will be responsible for RLEF training (Reinforcement Learning from Execution Feedback) of our LLM/agentic systems and helping us build an AI system with quantitative reasoning capability that can perform previously impossible tasks or achieve unprecedented levels of performance in the domain of designing physical systems.

> tech stack

Python
PyTorch
C++

> we expect you to

have strong programming skills and a deep understanding of machine learning
experience working with large distributed systems
be comfortable diving into a large ML codebase to debug
have a deep understanding of LLM architectures
have experience with LLM post-training with a focus on RL
execute and analyze experiments autonomously and collaboratively
be excited about the prospect of building an engineering AGI

> ++

you’ve built a popular/impactful open source project
you’ve published or co-authored LLM/RL papers
keep up with state-of-the-art LLM research

> you will thrive in this role if

you have a background in statistical machine learning, physics, mathematics, or another theoretically rigorous field
you are intellectually curious and quick to pick up concepts outside your area of expertise
you love working in a dynamic, fast-paced startup environment

> interview process

Initial screening - with Head of Talent (30 mins)
Hiring manager interview - with co-founder & Head of AI (30 mins)
Technical interview 1 (60 mins)
Technical interview 2 (60 mins)
Culture fit / Q&A (possibly in-person) - with co-founder & CEO

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.