Enable job alerts via email!

Senior LLMOps Engineer — Cloud AI Inference + Equity

TEEMA Solutions Group

Toronto

Hybrid

CAD 120,000 - 160,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A rapid-growth technology firm in Toronto is seeking a Staff LLMOps Engineer to lead the design and optimization of large language model infrastructure on the cloud. The ideal candidate has over 6 years of experience in DevOps and expertise in deploying LLMs in cloud environments. Responsibilities include architecting deployment pipelines and ensuring high-performance AI applications. Competitive salary and equity are included in the offer.

Benefits

Competitive salary

Meaningful equity

Innovative work culture

Qualifications

6+ years in DevOps or cloud platform engineering.
2+ years of experience with LLMs deployment.
Expertise with GPU-accelerated inference.

Responsibilities

Architect and operationalize LLM deployment pipelines on AWS.
Build and scale multi-GPU inference infrastructure.
Optimize inference performance using various frameworks.

Skills

DevOps expertise

ML infrastructure knowledge

Cloud platform experience

Python proficiency

Monitoring tools integration

Tools

AWS

Kubernetes

Terraform

Prometheus

Grafana

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.