Enable job alerts via email!

Machine Learning Research Scientist / Research Engineer, Science of Data

Enclustra

New York, San Francisco (IA, CA)

On-site

USD 176,000 - 255,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking talented Research Scientists and Engineers to push the boundaries of data science in generative AI. This role involves developing cutting-edge methodologies for synthetic and hybrid data generation, ensuring high-quality datasets that drive the next generation of AI capabilities. You will collaborate with leading researchers and engineers, publish your findings in top-tier conferences, and contribute to open-source initiatives. Join a forward-thinking team that values diversity and is committed to transforming industries through AI. If you're passionate about advancing the science of data, this opportunity is for you.

Benefits

Comprehensive health coverage
Dental and vision coverage
Retirement benefits
Learning and development stipend
Generous PTO
Commuter stipend

Qualifications

  • Strong background in deep learning and data-centric AI methodologies.
  • Experience in synthetic data generation and data selection.

Responsibilities

  • Develop and refine synthetic and hybrid data generation methods.
  • Collaborate with teams to establish best practices for AI datasets.

Skills

Deep Learning
Machine Learning
Data Generation
Data Quality Assessment
Communication Skills

Education

Ph.D. in Computer Science
Master's degree in AI

Tools

Python
PyTorch
TensorFlow

Job description

Machine Learning Research Scientist / Research Engineer, Science of Data

Scale works with the industry’s leading AI model labs to provide high quality data and accelerate progress in GenAI research. We are dedicated to advancing the science of data for generative AI. We develop innovative techniques for hybrid data generation and data quality assessment, ensuring high-quality and diverse datasets to drive the next generation of AI capabilities.

We are looking for Research Scientists and Research Engineers to advance the science of data and tackle challenges in data generation, quality assessment, and data selection for large-scale AI models. In this role, you will research and develop methodologies for synthetic and hybrid data generation, data quality and diversity analysis, and annotator behavior modeling. You will collaborate with researchers and engineers to define best practices in data-driven AI development. You will also partner with top foundation model labs to provide both technical and strategic input on the development of the next generation of generative AI models.

You will:

  • Develop and refine synthetic and hybrid (with human-in-the-loop) data generation methods to enhance model training.
  • Design and implement data quality frameworks, including data diversity analysis, data selection strategies, and detection of reward hacking.
  • Collaborate with internal teams and external partners to establish best practices for high-quality AI datasets.
  • Publish research findings in top-tier AI conferences and contribute to open-source data quality initiatives.

Ideally you’d have:

  • Ph.D., Master's degree/or equivalent experience in Computer Science, Machine Learning, AI, or a related field.
  • Strong background in deep learning, LLM, and data-centric AI methodologies.
  • Experience in synthetic data generation, data selection, reward hacking detection, human-in-the-loop data orc, and annotator behavior research.
  • Proficiency in Python and ML frameworks such as PyTorch or TensorFlow.
  • Excellent written and verbal communication skills.
  • Published research in areas of machine learning at major conferences (NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, etc.) and/or journals.
  • Previous experience in a customer facing role.

Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You’ll also receive benefits including, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.

The base salary range for this full-time position in the location of San Francisco is:

$176,000 - $255,000 USD

PLEASE NOTE:Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.

About Us:

At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications.

We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Data Scientist / Machine Learning Engineer - Ads Kitchener-Waterloo, ON; Toronto, ON

Faire

San Francisco

Hybrid

USD 196,000 - 270,000

25 days ago

Senior Data Scientist / Machine Learning Engineer - Personalization Kitchener-Waterloo, ON; Tor[...]

Faire

San Francisco

Hybrid

USD 168,000 - 231,000

25 days ago

Machine Learning Research Scientist/ Engineer, Agents

Scale AI

New York

On-site

USD 200,000 - 251,000

30+ days ago

Senior Data Scientist / Machine Learning Engineer - Search & Recommendation Kitchener-Waterloo,[...]

Faire

San Francisco

Hybrid

USD 168,000 - 231,000

25 days ago

Senior Data Scientist - Retailer Toronto, ON

Faire

San Francisco

Hybrid

USD 168,000 - 231,000

25 days ago

ML Research Engineer, ML Systems

Tbwa Chiat/Day Inc

New York

On-site

USD 200,000 - 251,000

30+ days ago

Senior Data Scientist I

Carta

New York

On-site

USD 170,000 - 200,000

30+ days ago

Associate Principal Scientist (Associate Director) - Real-world Data Analytics and Innovation

Hispanic Alliance for Career Enhancement

West Point

Hybrid

USD 139,000 - 220,000

16 days ago