Enable job alerts via email!

Machine Learning Deployment Engineer

Torc Robotics

Montreal

On-site

CAD 80,000 - 110,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a pioneering company in autonomous vehicle technology as an ML Deployment Engineer. This role involves deploying and optimizing machine learning models for real-time performance on edge devices, collaborating with cross-functional teams to enhance model efficiency. You'll have the chance to mentor junior engineers and implement cutting-edge techniques in a production-focused environment. If you're passionate about leveraging AI to transform the future of transportation, this is the perfect opportunity to make a significant impact in the autonomous vehicle industry.

Qualifications

  • 2+ years of experience in deploying ML models; proficiency in Python and C++.
  • Experience with deployment frameworks and containerization tools.

Responsibilities

  • Deploy and optimize ML models for production environments.
  • Design end-to-end inference pipelines for embedded systems.

Skills

Python
C++
Machine Learning
MLOps
Computer Vision

Education

Bachelor's degree in Computer Science
Master's degree in related field

Tools

Docker
Kubernetes
AWS
Azure
GCP
ONNX
TensorFlow
PyTorch

Job description

About the Company

At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business.

A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners. Now a part of the Daimler family, we are focused solely on developing software for automated trucks to transform how the world moves freight.

Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer.

Job Description Summary

The model development department is looking for an ML deployment engineer who will deploy our next generation machine learning models for our autonomous driving stack.

As a senior engineer of the team, you will apply machine learning science in a production-focused environment. You will use machine learning models in both a unimodal and multimodal context, to solve all tasks across the functional autonomous driving stack. Training, validation, data science, and architectural design will be your daily work. You will be interested in understanding how your model performs in deployment, for which you will collaborate closely with deployment-focused teams. You will mentor and guide more junior members of the team and will always be interested in the newest trends in research, eager to translate scientific improvements into our production-grade machine learning pipelines.

Meet the team

Torc's Autonomy Applications software utilizes cutting-edge deep learning techniques to perceive the vehicle's environment, predict the movements of other vehicles, and execute accurate driving decisions. We are actively seeking an experienced ML deployment engineer to join our model development department. This is an exceptional opportunity for you to have a significant impact on the future of the autonomous vehicle industry by leveraging AI.

What You’ll Do:

  1. Model Deployment & Optimization
    Deploy and optimize machine learning models for production environments, ensuring real-time performance and resource efficiency on edge devices and automotive-grade hardware.
    Implement model quantization, pruning, and compression techniques to enhance inference speed while maintaining accuracy.
    Collaborate with ML engineers to transition research-grade code (e.g., PyTorch) into production-ready, scalable systems.
  2. Inference Pipeline Development
    Design and optimize end-to-end inference pipelines for embedded systems, leveraging frameworks like ONNX, TensorFlow Serving, or PyTorch Serve.
    Integrate model outputs with upstream & downstream systems (e.g., perception, control modules) via APIs or middleware.
  3. Cross-Functional Collaboration
    Partner with DevOps teams to build CI/CD pipelines for automated model deployment, testing, and rollback.
    Work with hardware engineers to profile and optimize model performance on target devices (e.g., NVIDIA Jetson).
  4. Monitoring & Maintenance
    Develop tools and dashboards to monitor model performance, data drift, and system health in production.
    Implement A/B testing and canary deployment strategies to validate model updates.
  5. Infrastructure & Tools
    Optimize data pipelines for low-latency inference, including preprocessing and postprocessing workflows.
    Advocate for MLOps best practices (versioning, reproducibility, logging) across the ML lifecycle.

What You’ll Need to Succeed:

  1. Education & Experience
    Bachelor’s degree in computer science, engineering, or related field with 2+ years of experience in deploying ML models (or master’s with 1+ years).
    Proven expertise in deploying models to edge devices or cloud platforms (AWS, Azure, GCP).
  2. Technical Skills
    Mastery of Python and C++; familiarity with CUDA, TensorRT, or OpenVINO for acceleration.
    Experience with deployment frameworks (e.g., ONNX, TensorFlow Lite, PyTorch Mobile) and containerization (Docker, Kubernetes).
    Knowledge of performance profiling tools (e.g., NVIDIA Nsight, VTune) and optimization techniques (e.g., layer fusion, memory management).
  3. Domain Knowledge
    Understanding of ML model lifecycle challenges (e.g., drift, scalability) and MLOps principles.
    Familiarity with computer vision, LiDAR/radar data, or sensor fusion workflows is a plus.
  4. Bonus Points!
    Experience with NVIDIA libraries (CUDA, CuDNN, TensorRT) or embedded SDKs (JetPack, DeepStream).
    Proficiency in distributed inference using Ray or Horovod.
    Cloud certifications (AWS ML Specialty, Azure AI Engineer) or MLOps tools (MLflow, Kubeflow).
    Knowledge of security practices for ML systems (e.g., adversarial defense, encrypted inference).

At Torc, we’re committed to building a diverse and inclusive workplace. We celebrate the uniqueness of our Torc’rs and do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, veteran status, or disabilities.

Even if you don’t meet 100% of the qualifications listed for this opportunity, we encourage you to apply.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Machine Learning Deployment Engineer Montreal, Canada

Torc Robotics, Inc.

Montreal

On-site

CAD 80,000 - 120,000

30+ days ago

Machine Learning Deployment Engineer

Torc Robotics

Montreal

On-site

CAD 80,000 - 110,000

30+ days ago

Machine Learning Deployment Developer

Ubisoft

Montreal

On-site

CAD 70,000 - 110,000

30+ days ago