Enable job alerts via email!

Research Engineer, Frontier Evals

OpenAI

San Francisco (CA)

On-site

USD 200,000 - 370,000

Full time

2 days ago

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in AI research is seeking a Research Engineer for their Safety Systems team. This role involves identifying AI safety risks, building evaluations for frontier AI models, and contributing to risk management practices. Candidates should have a strong background in ML research and a passion for AI safety. Join us in ensuring the safe deployment of AI technologies that benefit society.

Benefits

Equity Offers

Qualifications

Experience in ML research engineering and AI safety risks.
Ability to operate in a fast-paced research environment.

Responsibilities

Identify emerging AI safety risks and methodologies.
Build evaluations of frontier AI models.
Design scalable systems for evaluations.

Skills

AI Safety

ML Research Engineering

Communication

Research Engineer, Frontier Evals | OpenAI

Careers

Research Engineer, Frontier Evals

Safety Systems - San Francisco

Apply now (opens in a new window)

About the team

The Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to the real world to benefit the society and is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency.

Frontier AI models have the potential to benefit all of humanity, but also pose increasingly severe risks. To ensure that AI promotes positive change, the Frontier Evals team helps us prepare for the development of increasingly capable frontier AI models. This team is tasked with identifying, tracking, and preparing for catastrophic risks related to frontier AI models.

Specifically, the mission of the Frontier Evals team is to:

Closely monitor and predict the evolving capabilities of frontier AI systems, with an eye towards misuse risks whose impact could be catastrophic (not necessarily existential) to our society; and

Ensure we have concrete procedures, infrastructure and partnerships to mitigate these risks and, more broadly, to safely handle the development of powerful AI systems.

Our team will tightly connect capability assessment, evaluations, and internal red teaming for frontier models, as well as overall coordination on AGI preparedness. The team’s core goal is to ensure that we have the infrastructure needed for the safety of highly-capable AI systems—from the models we develop in the near future to those with AGI-level capabilities.

About you

We are looking to hire exceptional research engineers that can push the boundaries of our frontier models. Specifically, we are looking for those that will help us shape our empirical grasp of the whole spectrum of AI safety concerns and will own individual threads within this endeavor end-to-end.

In this role, you'll:

Work on identifying emerging AI safety risks and new methodologies for exploring the impact of these risks
Build (and then continuously refine) evaluations of frontier AI models that assess the extent of identified risks
Design and build scalable systems and processes that can support these kinds of evaluations
Contribute to the refinement of risk management and the overall development of "best practice" guidelines for AI safety evaluations

We expect you to be:

Passionate and knowledgeable about short-term and long-term AI safety risks
Able to think outside the box and have a robust “red-teaming mindset”
Experienced in ML research engineering, ML observability and monitoring, creating large language model-enabled applications, and/or another technical domain applicable to AI risk
Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end

It would be great if you also have:

First-hand experience in red-teaming systems—be it computer systems or otherwise
A good understanding of the (nuances of) societal aspects of AI deployment
An ability to work cross-functionally
Excellent communication skills

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via thislink .

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation

$200K – $370K + Offers Equity

Apply now (opens in a new window)

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

(Senior) Machine Learning Research Engineer, Healthcare Data - Remote

Freenome

South San Francisco

Remote

USD 157,000 - 240,000

30+ days ago

Research Engineer, Post-Training Evals

OpenAI

San Francisco

Hybrid

USD 295,000 - 530,000

10 days ago

Systems Research Engineer, GPU Programming

CRM Hike

San Francisco

Remote

USD 160,000 - 230,000

30+ days ago

Research Engineer, Tokens ML Infra

Anthropic

San Francisco

Hybrid

USD 315,000 - 425,000

4 days ago

Be an early applicant

Machine Learning Research Engineer, Natural Language Generation (NLG), Apple Intelligence

Apple

Cupertino

On-site

USD 143,000 - 265,000

5 days ago

Be an early applicant

Multimodal Generative Modeling Research Engineer - SIML, ISE

Apple

Cupertino

On-site

USD 175,000 - 313,000

6 days ago

Be an early applicant

Machine Learning Research Engineer

Quantix Search

San Francisco

On-site

USD 250,000 - 300,000

11 days ago