Job Search and Career Advice Platform

Enable job alerts via email!

Senior LLMOps Engineer — Cloud AI Inference + Equity

TEEMA Solutions Group

Toronto

Hybrid

CAD 120,000 - 160,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A rapid-growth technology firm in Toronto is seeking a Staff LLMOps Engineer to lead the design and optimization of large language model infrastructure on the cloud. The ideal candidate has over 6 years of experience in DevOps and expertise in deploying LLMs in cloud environments. Responsibilities include architecting deployment pipelines and ensuring high-performance AI applications. Competitive salary and equity are included in the offer.

Benefits

Competitive salary
Meaningful equity
Innovative work culture

Qualifications

  • 6+ years in DevOps or cloud platform engineering.
  • 2+ years of experience with LLMs deployment.
  • Expertise with GPU-accelerated inference.

Responsibilities

  • Architect and operationalize LLM deployment pipelines on AWS.
  • Build and scale multi-GPU inference infrastructure.
  • Optimize inference performance using various frameworks.

Skills

DevOps expertise
ML infrastructure knowledge
Cloud platform experience
Python proficiency
Monitoring tools integration

Tools

AWS
Kubernetes
Terraform
Prometheus
Grafana
Job description
A rapid-growth technology firm in Toronto is seeking a Staff LLMOps Engineer to lead the design and optimization of large language model infrastructure on the cloud. The ideal candidate has over 6 years of experience in DevOps and expertise in deploying LLMs in cloud environments. Responsibilities include architecting deployment pipelines and ensuring high-performance AI applications. Competitive salary and equity are included in the offer.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.