Enable job alerts via email!

Research Engineer (Pre-training)

HartleyCo

San Francisco (CA)

On-site

USD 175,000 - 250,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a Research Engineer in Pre-Training to enhance AI models through advanced techniques. In this pivotal role, you will design and optimize pre-training methods, conduct large-scale experiments, and collaborate with cross-functional teams to bring cutting-edge AI solutions to real-world applications. This fully on-site position in San Francisco offers a unique opportunity to work at the forefront of AI research and development, making significant contributions to the field. If you are passionate about AI and eager to tackle complex challenges, we invite you to apply and be part of a world-class team pushing the boundaries of technology.

Benefits

Relocation Assistance

Qualifications

Hands-on experience with large-scale models and ML fundamentals.
Ability to deploy pre-trained models in production environments.

Responsibilities

Design and optimize novel pre-training methods for models.
Conduct large-scale training experiments and analyze results.
Collaborate with teams to refine models for applications.

Skills

Machine Learning Fundamentals

Python

JAX

PyTorch

TensorFlow

Problem-Solving

Tools

Megatron

DeepSpeed

MaxText

This range is provided by HartleyCo. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$175,000.00/yr - $250,000.00/yr

About Our Client:

Our client is assembling a world-class team to push the boundaries of AI research and development. Their mission is to build models that generalize and adapt to novel problems, bridging cutting-edge research with real-world applications.

The Role:

As a Research Engineer in Pre-Training, you'll develop and implement advanced pre-training techniques to enhance foundation models. Working closely with Research Scientists, you'll experiment with model architectures, training strategies, and large-scale data pipelines to drive AI innovation.

What You’ll Do:

Design and optimize novel pre-training methods to improve model performance.
Conduct large-scale training experiments and analyze results.
Collaborate with data and product teams to refine models for real-world applications.
Build scalable pre-training pipelines for massive datasets.

What We’re Looking For:

Strong ML fundamentals and hands-on experience with large-scale models.
Proficiency in Python, JAX, PyTorch, or TensorFlow.
Experience with frameworks like Megatron, DeepSpeed, or MaxText.
Problem-solving mindset with a research-driven approach.
Ability to deploy pre-trained models in production environments.

This is a fully on-site role in San Francisco, with relocation assistance available. If you're passionate about AI research, we’d love to hear from you!

Seniority level

Mid-Senior level

Employment type

Full-time

Job function

Information Technology

Industries

IT Services and IT Consulting

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Research Engineer, Tokens (Pre-training)

Freddie Mac

San Francisco

Hybrid

USD 150,000 - 425,000

Today

Be an early applicant

Research Engineer, Language - Generative AI

San Francisco

On-site

USD 70,000 - 208,000

11 days ago

AI Research Engineer, VLM, Autonomy & Robotics

Tesla

Palo Alto

On-site

USD 124,000 - 420,000

14 days ago

Research Engineer, Language - Generative AI

San Francisco

On-site

USD 8,000 - 251,000

14 days ago