Enable job alerts via email!

Research Engineer, Frontier Evals

OpenAI

San Francisco (CA)

On-site

USD 200,000 - 370,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in AI research is seeking a Research Engineer for their Safety Systems team. This role involves identifying AI safety risks, building evaluations for frontier AI models, and contributing to risk management practices. Candidates should have a strong background in ML research and a passion for AI safety. Join us in ensuring the safe deployment of AI technologies that benefit society.

Benefits

Equity Offers

Qualifications

  • Experience in ML research engineering and AI safety risks.
  • Ability to operate in a fast-paced research environment.

Responsibilities

  • Identify emerging AI safety risks and methodologies.
  • Build evaluations of frontier AI models.
  • Design scalable systems for evaluations.

Skills

AI Safety
ML Research Engineering
Communication

Job description

Research Engineer, Frontier Evals | OpenAI

Careers

Research Engineer, Frontier Evals

Safety Systems - San Francisco

Apply now (opens in a new window)

About the team

The Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to the real world to benefit the society and is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency.

Frontier AI models have the potential to benefit all of humanity, but also pose increasingly severe risks. To ensure that AI promotes positive change, the Frontier Evals team helps us prepare for the development of increasingly capable frontier AI models. This team is tasked with identifying, tracking, and preparing for catastrophic risks related to frontier AI models.

Specifically, the mission of the Frontier Evals team is to:

  • Closely monitor and predict the evolving capabilities of frontier AI systems, with an eye towards misuse risks whose impact could be catastrophic (not necessarily existential) to our society; and

  • Ensure we have concrete procedures, infrastructure and partnerships to mitigate these risks and, more broadly, to safely handle the development of powerful AI systems.

  • Our team will tightly connect capability assessment, evaluations, and internal red teaming for frontier models, as well as overall coordination on AGI preparedness. The team’s core goal is to ensure that we have the infrastructure needed for the safety of highly-capable AI systems—from the models we develop in the near future to those with AGI-level capabilities.

    About you

    We are looking to hire exceptional research engineers that can push the boundaries of our frontier models. Specifically, we are looking for those that will help us shape our empirical grasp of the whole spectrum of AI safety concerns and will own individual threads within this endeavor end-to-end.

    In this role, you'll:

    • Work on identifying emerging AI safety risks and new methodologies for exploring the impact of these risks

    • Build (and then continuously refine) evaluations of frontier AI models that assess the extent of identified risks

    • Design and build scalable systems and processes that can support these kinds of evaluations

    • Contribute to the refinement of risk management and the overall development of "best practice" guidelines for AI safety evaluations

    We expect you to be:

    • Passionate and knowledgeable about short-term and long-term AI safety risks

    • Able to think outside the box and have a robust “red-teaming mindset”

    • Experienced in ML research engineering, ML observability and monitoring, creating large language model-enabled applications, and/or another technical domain applicable to AI risk

    • Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end

    It would be great if you also have:

    • First-hand experience in red-teaming systems—be it computer systems or otherwise

    • A good understanding of the (nuances of) societal aspects of AI deployment

    • An ability to work cross-functionally

    • Excellent communication skills

    About OpenAI

    OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

    We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

    OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

    For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

    We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via thislink .

    OpenAI Global Applicant Privacy Policy

    At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

    Compensation

    $200K – $370K + Offers Equity

    Apply now (opens in a new window)

    Get your free, confidential resume review.
    or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

    Similar jobs

    (Senior) Machine Learning Research Engineer, Healthcare Data - Remote

    Freenome

    South San Francisco

    Remote

    USD 157,000 - 240,000

    30+ days ago

    Research Engineer, Post-Training Evals

    OpenAI

    San Francisco

    Hybrid

    USD 295,000 - 530,000

    10 days ago

    Systems Research Engineer, GPU Programming

    CRM Hike

    San Francisco

    Remote

    USD 160,000 - 230,000

    30+ days ago

    Research Engineer, Tokens ML Infra

    Anthropic

    San Francisco

    Hybrid

    USD 315,000 - 425,000

    4 days ago
    Be an early applicant

    Machine Learning Research Engineer, Natural Language Generation (NLG), Apple Intelligence

    Apple

    Cupertino

    On-site

    USD 143,000 - 265,000

    5 days ago
    Be an early applicant

    Multimodal Generative Modeling Research Engineer - SIML, ISE

    Apple

    Cupertino

    On-site

    USD 175,000 - 313,000

    6 days ago
    Be an early applicant

    Machine Learning Research Engineer

    Quantix Search

    San Francisco

    On-site

    USD 250,000 - 300,000

    11 days ago

    Research Engineer

    Helm.ai

    Redwood City

    Remote

    USD 150,000 - 250,000

    30+ days ago

    Research Engineer

    Decagon

    San Francisco

    On-site

    USD 180,000 - 270,000

    4 days ago
    Be an early applicant