Enable job alerts via email!

Machine Learning Platform Engineer

LUCID

Canada

Remote

CAD 80,000 - 120,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

LUCID Therapeutics is seeking a Machine Learning Platform Engineer to build a robust AI platform to improve health and wellness. This hands-on role focuses on establishing infrastructure for ML model lifecycle management, enhancing consumer therapeutic experiences through innovative AI solutions.

Qualifications

  • 5+ years software experience writing production-grade Python code.
  • 2+ years building ML platforms on Kubernetes.
  • Experience with CI/CD tools.

Responsibilities

  • Set up a cloud-native ML control plane (Kubeflow) via Helm/Terraform.
  • Implement CI/CD/CT pipelines and deployment protocols.
  • Build observability dashboards for ML monitoring.

Skills

Python
Kubernetes
MLFlow
GitOps
Collaboration

Job description

Get AI-powered advice on this job and more exclusive features.

LUCID Therapeutics | Remote (Canada-based Preferred)

Want to leverage your MLOps skills in a way that will give you an opportunity to make a deep impact on consumer health and wellness? LUCID is looking to mature its MLOps practises by building out a robust AI platform, and is looking for a talented Machine Learning Platform Engineer to join its ranks in building out a platform that will help abstract infrastructure, robustness, and scalability, streamlining the machine learning model lifecycle and empowering the ML team to spend less time on undifferentiated heavy lifts and spend more time on the things that matter: improving the capabilities of our models, and through them, the quality of the therapeutic value we provide to our consumers. As an ML Platform Engineer, you will lay the foundation for a platform that will serve as the core delivery of LUCID’s AI offerings, improving the health and fitness music therapies we provide our customers.

This is a hands-on role for someone who wants to build a robust and production-ready ML platform from the ground up.

LUCID Therapeutics | Remote (Canada-based Preferred)

Summary

Want to leverage your MLOps skills in a way that will give you an opportunity to make a deep impact on consumer health and wellness? LUCID is looking to mature its MLOps practises by building out a robust AI platform, and is looking for a talented Machine Learning Platform Engineer to join its ranks in building out a platform that will help abstract infrastructure, robustness, and scalability, streamlining the machine learning model lifecycle and empowering the ML team to spend less time on undifferentiated heavy lifts and spend more time on the things that matter: improving the capabilities of our models, and through them, the quality of the therapeutic value we provide to our consumers. As an ML Platform Engineer, you will lay the foundation for a platform that will serve as the core delivery of LUCID’s AI offerings, improving the health and fitness music therapies we provide our customers.

This is a hands-on role for someone who wants to build a robust and production-ready ML platform from the ground up.

What You’ll Do

We are looking for a talented Machine Learning Platform Engineer to join us in building out a robust AI and Data Platform, from initial infrastructure to final product, focusing on supporting the infrastructure behind the entire Machine Learning model lifecycle: Experimentation, Training, Deployment, Serving, Testing, and Analysis.

Your first 60 days focus on standing up Kubeflow + KServe + MLFlow, followed by drift detection, retraining, and multi-region resilience.

  • Platform Infrastructure: Set up a cloud-native ML control plane (Kubeflow) via Helm/Terraform to serve as the foundation of our ML Platform.
  • CI/CD/CT Pipelines: Implement unit/integration tests, container builds, artifacts, progressive delivery
  • Model Registries: Set up model registries and version tracking.
  • Model Deployment and Serving: Set up deployment protocols (canary, blue-green, A/B testing), workflows, and inference endpoints (KServe)
  • Scalability: Configure CPU/GPU autoscaling, node-pool tuning, and resource quotas; collaborate with DevOps to set up FinOps guardrails.
  • Performance Analysis: Set up performance analysis to help the ML team monitor model performance in production, and assess for model drift.
  • Retraining: Stand up retraining, hyperparameter sweeps, and continuous-training pipelines for the ML team.
  • Dashboarding: Build out observability dashboards for the ML team to monitor training progress and performance.
  • Data: Work with a Data Platform Engineer to deliver feature stores, model metadata stores, and data pipelines.
What You Bring
  • 5+ years Software Experience writing production-grade Python code.
  • 2+ years building ML Platforms on Kubernetes.
  • Platform Infrastructure: You know how to set up a Kubeflow environment, and its major components (Notebooks, Pipelines, Trainer, Katib).
  • Operations: Hands-on Kubernetes, Helm, and Terraform (or similar IaC) experience
  • Model Lifecycle Knowledge: You understand the ML model lifecycle and the platform tools to improve it, including experiment tracking (MLFlow), model registry, drift monitoring, and feature engineering pipelines.
  • Drift Analysis/AutoML: You know how to use model drift monitoring tools (Evidently AI) to trigger automated re-training
  • GitOps: Experience with Github Actions or similar GitOps tooling
  • Agility: Ability to work in a fast-paced, early-stage environment—delivering MVPs, iterating rapidly, and scaling systems as product-market fit emerges.
  • Collaboration: Exceptional communication and teamwork skills.
  • Startup Mindset: Resourceful, experimental, and resilient in the face of ambiguity and evolving requirements.
  • (Bonus) You’ve designed API interfaces (Flask, FastAPI).
  • (Bonus) You have experience building observability dashboards (Prometheus, Grafana).

Nice-to-have:

  • You’ve worked with event-driven architectures (RabbitMQ, Kafka)
  • You know how to set up Ray clusters for scalability
Logistics
  • Full-time. Remote-first (Canada-based preferred).
  • Contract (6-12 months) .
  • Reports to: Data and AI Platform Lead; Close collaboration with ML team.
  • Start: Targeting early July 2025.
About LUCID

LUCID Therapeutics is pioneering a new class of mobile health experiences—fusing AI, neuroscience, and sound to unlock human potential. Join us at the inception and help invent what comes next.

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Contract
Job function
  • Job function
    Engineering and Information Technology

Referrals increase your chances of interviewing at LUCID by 2x

Sign in to set job alerts for “Machine Learning Engineer” roles.
Full-Stack Software Engineer (New graduates: Canada)

Canada CA$80,000.00-CA$120,000.00 2 weeks ago

Canada CA$125,000.00-CA$175,000.00 16 hours ago

Toronto, Ontario, Canada CA$125,000.00-CA$175,000.00 16 hours ago

Ontario, Canada CA$125,000.00-CA$175,000.00 16 hours ago

Frontend Software Engineer (Remote - Canada)

Montreal, Quebec, Canada $35,000.00-$46,000.00 1 month ago

Machine Learning Engineer II - Core Experience

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Core Infrastructure Engineer - Platforms Orchestration

Kraken Digital Asset Exchange

Remote

CAD 100,000 - 203,000

9 days ago

Senior Machine Learning Engineer, Platform

theScore

Toronto

Remote

CAD 100,000 - 140,000

14 days ago

Senior Machine Learning Engineer, Platform

Houston Texans

Toronto

Remote

CAD 90,000 - 130,000

14 days ago

Sr. Platform Engineer

Aha!

Remote

CAD 110,000 - 190,000

27 days ago

Senior Platform Engineer

TopHat

Remote

CAD 90,000 - 130,000

10 days ago

Associate Platform Engineer

Top Hat

Remote

CAD 118,000 - 145,000

20 days ago

Software Engineer (Platform engineering)

Kantar Group

Toronto

Remote

CAD 65,000 - 110,000

2 days ago
Be an early applicant

Senior DevOps Engineer - Platform

PocketHealth

Ontario

Remote

CAD 100,000 - 140,000

5 days ago
Be an early applicant

Senior Platform Engineer

Cetaris

Ontario

Remote

CAD 90,000 - 120,000

5 days ago
Be an early applicant