Job Search and Career Advice Platform

Enable job alerts via email!

Machine Learning Data Engineer, Replica Pipelines

Parallel Domain

Vancouver

Hybrid

CAD 130,000 - 160,000

Full time

11 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A tech company specializing in AI is looking for a Machine Learning Data Engineer to support the development of their Replica product. In this role, you'll build and scale data pipelines that normalize and validate data for machine learning models. The ideal candidate will have a strong background in data engineering, 3D concepts, and proficiency in Python. You'll work closely with ML engineers and have the opportunity to contribute to cutting-edge developments in autonomous systems. Competitive compensation is offered, ranging from $130K to $160K/year.

Benefits

Competitive compensation
Collaborative culture
Professional growth opportunities
Impactful work

Qualifications

  • Proven experience building scalable data pipelines and tooling.
  • Understanding of how data is used in model training and evaluation.
  • Practical experience with 3D concepts and computer vision principles.
  • Strong Python proficiency and comfort with large datasets.
  • Experience working closely with ML engineers on data needs.

Responsibilities

  • Build reliable pipelines to normalize and validate data.
  • Create schemas, validation checks, and quality metrics for datasets.
  • Implement tools for dataset filtering, versioning, and annotation support.
  • Generate high-quality data feeds for ML training and evaluation.

Skills

Data engineering experience
ML-aware engineering
3D Foundations
Strong Python proficiency
Collaborative mindset

Education

MS or PhD in ML, computer vision, robotics, or related field

Tools

Cloud storage
Data visualization systems
Job description

Parallel Domain is building the world’s most advanced simulation and digital twin platform for autonomy, robotics, and computer vision. Our Replica product creates large-scale, photorealistic digital twins of real-world environments used for testing, validation, and development of autonomous systems.

About the role:
  • We are hiring a Machine Learning Data Engineer responsible for building and scaling the data pipelines that support Replica and ML model development. You will ensure that data flows efficiently from raw customer inputs through validated, structured formats suitable for training, evaluation, and production systems.
What you'll do:
  • Own data ingestion: Build reliable pipelines to normalize and validate customer and synthetic data.
  • Define data standards: Create schemas, validation checks, and quality metrics for Replica datasets.
  • Build curation tooling: Implement tools for dataset filtering, versioning, and annotation support.
  • Enable ML workflows: Generate high-quality data feeds for training and evaluation across ML models.
What you’ll bring:
  • Data engineering experience: Proven experience building scalable data pipelines and tooling.
  • ML-aware engineering: Understanding of how data is used in model training and evaluation.
  • 3D Foundations: Practical experience with 3D concepts, geometry, and the linear algebra principles underpinning computer vision (e.g., projections, transformations)
  • Technical skills: Strong Python proficiency and comfort with large datasets.
  • Collaborative mindset: Experience working closely with ML engineers on data needs.
What will help you stand out:
  • Advanced degree: MS or PhD in ML, computer vision, robotics, or related field.
  • Cloud/infra experience: Familiarity with cloud storage and distributed processing frameworks.
  • Robotics data knowledge: Experience handling camera, lidar, or radar data
  • Visualization tools experience: Familiarity with data visualization systems like Foxglove, Rerun, or Voxel51
  • MLOps tooling exposure: Experience with dataset versioning, preprocessing automation, or training pipeline orchestration.
What we offer:
  • Competitive compensation: A base pay range of $130,000 - $160,000/yr, depending on your skills, qualifications, experience, and location.
  • Impactful work: The chance to contribute to the advancement of autonomous systems and AI.
  • Collaborative culture: A dynamic and supportive work environment where your ideas are valued.
  • Professional growth: Opportunities to learn and develop your skills in a cutting-edge field.

If you're passionate about machine learning, 3D reconstruction, generative AI, and the future of autonomous systems, we'd love to hear from you. Apply today and help us revolutionize the world of AI!

This position is available in Vancouver, BC and Karlsruhe DE.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.