Enable job alerts via email!

Member of Technical Staff - Foundational Model Data

Liquid AI

San Francisco (CA)

On-site

USD 60,000 - 240,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Liquid AI is seeking a highly skilled Machine Learning Engineer to enhance its foundational model data processes. The successful candidate will be responsible for managing large datasets and developing pipelines for data generation, making a significant impact on AI system efficiency and effectiveness.

Qualifications

  • B.S. + 5 years experience or M.S. + 3 years experience or Ph.D. + 1 year.
  • Expertise in data curation, cleaning, and augmentation.
  • Strong programming skills in Python.

Responsibilities

  • Create and maintain data cleaning and filtering pipelines for large datasets.
  • Gather datasets from the web and maintain synthetic data generation pipelines.
  • Run ablation studies to evaluate dataset quality.

Skills

Data curation
Data cleaning
Data augmentation
Synthetic data generation
Programming in Python
Machine Learning frameworks
Debugging models
Working with LLMs

Education

B.S. in relevant field
M.S. in Computer Science/Engineering
Ph.D. in relevant field

Job description

Member of Technical Staff - Machine Leaning Engineer; Foundational Model Data

Join to apply for the Member of Technical Staff - Machine Leaning Engineer; Foundational Model Data role at Liquid AI

Member of Technical Staff - Machine Leaning Engineer; Foundational Model Data

1 month ago Be among the first 25 applicants

Join to apply for the Member of Technical Staff - Machine Leaning Engineer; Foundational Model Data role at Liquid AI

Liquid AI, an MIT spin-off, is a foundation model company headquartered in Boston, Massachusetts. Our mission is to build capable and efficient general-purpose AI systems at every scale.

Our goal at Liquid is to build the most capable AI systems to solve problems at every scale, such that users can build, access, and control their AI solutions. This is to ensure that AI will get meaningfully, reliably and efficiently integrated at all enterprises. Long term, Liquid will create and deploy frontier-AI-powered solutions that are available to everyone.

We are seeking a highly skilled Member of Technical Staff, Foundation Model Data to play a critical role in our foundation model development process. This role focuses on consolidating, gathering, and generating high-quality text data for pretraining, midtraining, SFT, and preference optimization.

Key Responsibilities

  • Create and maintain data cleaning, filtering, selection pipeline than can handle >100TB of data
  • Watch out for the release of public dataset on huggingface and other platforms
  • Create crawlers to gather datasets from the web where public data is lacking
  • Write and maintain synthetic data generation pipelines
  • Run ablations to assess new dataset and judging pipelines



Required Qualifications

  • Experience Level: B.S. + 5 years experience or M.S. + 3 years experience or Ph.D. + 1 year of experience
  • Dataset Engineering: Expertise in data curation, cleaning, augmentation, and synthetic data generation techniques
  • Machine Learning Expertise: Ability to write and debug models in popular ML frameworks, and experience working with LLMs
  • Software Development: Strong programming skills in Python, with an emphasis on writing clean, maintainable, and scalable code



Preferred Qualifications

  • M.S. or Ph.D. in Computer Science, Electrical Engineering, Math, or a related field
  • Experience fine-tuning or customizing LLMs
  • First-author publications in top ML conferences (e.g. NeurIPS, ICML, ICLR)
  • Contributions to popular open-source projects

Seniority level
  • Seniority level
    Not Applicable
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    Software Development

Referrals increase your chances of interviewing at Liquid AI by 2x

Get notified about new Member of Technical Staff jobs in San Francisco, CA.

Berkeley, CA $60,000.00-$240,000.00 1 year ago

Consulting Member of Technical Staff (IC5)
Member of Technical Staff - Computational Biologist
Member of Technical Staff - Head of Engineering

San Francisco, CA $101,500.00-$156,750.00 2 months ago

Alameda, CA $96,000.00-$137,500.00 13 hours ago

San Francisco, CA $136,947.00-$239,699.00 5 months ago

San Francisco, CA $90,000.00-$130,000.00 6 months ago

Associate Director of Counseling & Psychological Services - (Administrator II) - Counseling and Psychological Services

San Francisco, CA $10,000.00-$120,000.00 7 months ago

Project Archaeologist/ Cultural Resources Specialist

San Francisco, CA $141,800.00-$221,600.00 10 hours ago

San Mateo, CA $141,800.00-$221,600.00 11 hours ago

San Francisco, CA $116,000.00-$200,100.00 1 day ago

Member of Technical Staff (Senior/Staff)

San Francisco, CA $145,000.00-$220,000.00 3 months ago

Member of Technical Staff - General Interest
Member of Technical Staff - Compute Platform
Quantum Engineer - Member of Technical Staff

San Francisco, CA $120,000.00-$180,000.00 1 month ago

Member of Technical Staff, Founding Design Engineer

San Francisco, CA $130,000.00-$200,000.00 5 months ago

Member of Technical Staff, Founding Frontend Engineer

San Francisco, CA $130,000.00-$200,000.00 4 months ago

Member of Technical Staff, Founding Backend Engineer

San Francisco, CA $150,000.00-$200,000.00 5 months ago

San Francisco, CA $160,000.00-$175,000.00 4 days ago

San Francisco, CA $200,000.00-$260,000.00 2 weeks ago

San Francisco, CA $200,000.00-$260,000.00 4 days ago

San Francisco, CA $160,000.00-$175,000.00 2 days ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.