Enable job alerts via email!

Machine Learning Engineer, Datasets

Runway

United States

On-site

USD 80,000 - 150,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative company is seeking a Dataset Engineer to join their dynamic team. This role focuses on curating and optimizing large-scale datasets for model training, ensuring data quality and implementing efficient feedback loops. You will collaborate with a world-class research team to push the boundaries of content creation using advanced machine learning techniques. If you are passionate about creativity and technology, this position offers the opportunity to work on cutting-edge projects that redefine storytelling through AI. Join a forward-thinking organization committed to diversity and equal opportunity, where your contributions will make a significant impact.

Qualifications

  • 4+ years of experience in machine learning or dataset engineering.
  • Strong data analysis and SQL skills with a focus on data quality.

Responsibilities

  • Develop and maintain large-scale datasets for training models.
  • Create evaluations and benchmark analyses for datasets.

Skills

Machine Learning
Data Analysis
SQL
Data Curation
Programming
Model Optimization
Data Modeling
Creativity Tools Understanding

Education

Bachelor's Degree in Computer Science or related field

Tools

PyTorch
TensorFlow
Ray
Kubernetes
Airflow
Prefect

Job description

Member of Technical Staff, Dataset Engineering

Remote

Runway is an applied research company pioneering new tools for human imagination. Runway has been at the forefront of multi-modal AI systems ensuring that the future of media creation is accessible, controllable and empowering for creatives. Runway’s mission is to ensure that anyone anywhere can tell their stories. We believe that deep learning techniques applied to audiovisual content will forever change art, creativity and design tools.

Runway is leading a shift to generative media that is unlocking an unprecedented level of creative potential. The invention of the camera 200 years ago forever changed our world – AI is a new kind of camera that will reshape storytelling forever.

About the role

*Open to hiring remote across North America and Europe — we also have offices in NYC, San Francisco, Seattle, and London

We're looking for Dataset Engineers to help curate, build, and optimize datasets for model training. The ideal candidate for this role has strong machine learning skills, extensive experience working with and analyzing large-scale datasets, and an understanding of creativity tools. You should be proficient in ensuring data quality and tight feedback loops between data preprocessing and model training.

What you'll do
  • Develop and maintain large-scale, multimodal datasets for training and evaluating models
  • Create and run evaluations and benchmark analyses for datasets and models
  • Implement fast iteration cycles and feedback loops to continuously improve model datasets
  • Work with a world-class research team to push the boundaries of content creation
  • Evaluate new datasets and models for upstream data tasks that feed into our products
What you'll need
  • 4+ years of relevant experience in machine learning or dataset engineering, ideally with multimodal datasets
  • Experience with running and optimizing models offline at large scale
  • Excellent data modeling skills and experience with data curation
  • Proficiency in model finetuning and optimization for data preprocessing
  • Strong data analysis and SQL skills
  • Experience in creating evaluations and running benchmark analyses
  • Solid knowledge of at least one machine learning framework (e.g. PyTorch, JAX, TensorFlow)
  • Very strong programming skills and ability to write clean and maintainable code
  • Deep interest in building human-in-the-loop systems for creativity
  • Ability to rapidly prototype solutions and iterate on them with tight product deadlines
  • Strong familiarity with tools such as Ray, Kubernetes, Airflow, Prefect
  • Excellent communication, collaboration, and documentation skills

Runway strives to recruit and retain exceptional talent from diverse backgrounds while ensuring pay equity for our team. Our salary ranges are based on competitive market rates for our size, stage and industry, and salary is just one part of the overall compensation package we provide.

There are many factors that go into salary determinations, including relevant experience, skill level and qualifications assessed during the interview process, and maintaining internal equity with peers on the team. The range shared below is a general expectation for the function as posted, but we are also open to considering candidates who may be more or less experienced than outlined in the job description. In this case, we will communicate any updates in the expected salary range.

Lastly, the provided range is the expected salary for candidates in the U.S. Outside of those regions, there may be a change in the range, which again, will be communicated to candidates.

We’re committed to creating a space where our employees can bring their full selves to work and have equal opportunity to succeed. So regardless of race, gender identity or expression, sexual orientation, religion, origin, ability, age, veteran status, if joining this mission speaks to you, we encourage you to apply.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Machine Learning Engineer

Infinite Reality

Remote

USD 133,000 - 151,000

4 days ago
Be an early applicant

Founding Machine Learning Engineer (Back-End Focus)

Embark On Talent Ltd

Remote

USD 100,000 - 720,000

4 days ago
Be an early applicant

Senior Machine Learning Engineer

Calix

Remote

USD 116,000 - 227,000

2 days ago
Be an early applicant

Data Scientist

Byram Healthcare

Remote

USD 80,000 - 120,000

Yesterday
Be an early applicant

Sr. Data Engineer (Databricks)

Interactive Resources - iR

Remote

USD 130,000 - 160,000

6 days ago
Be an early applicant

Lead Data Scientist - Databricks ML experience

ON Data Staffing

Remote

USD 100,000 - 720,000

4 days ago
Be an early applicant

Data Scientist

Byram Healthcare

New Mexico

Remote

USD 100,000 - 125,000

Yesterday
Be an early applicant

Data Engineer II - (Remote - US)

Jobgether

Remote

USD 120,000 - 160,000

5 days ago
Be an early applicant

Sr. Machine Learning Engineer

Mr. Cooper Group Inc.

Lewisville

Remote

USD 90,000 - 150,000

4 days ago
Be an early applicant