Enable job alerts via email!

Backend Engineer, AI / ML

Klue

Vancouver, Toronto

On-site

CAD 145,000 - 180,000

Full time

30+ days ago

Job summary

A tech company in Vancouver is seeking a Backend Engineer with a focus on AI/ML. You will design and build retrieval infrastructure, optimizing it for real-time user-facing systems. The ideal candidate will have strong backend programming skills and experience in search and retrieval systems. Competitive salary range is CA$145,000 - CA$180,000, with hybrid working options.

Benefits

Extended health & dental benefits

Employee Stock Option Plan

Flexible PTO policy

Qualifications

3+ years of backend engineering experience, ideally in search and retrieval systems.
Strong programming skills in Python, Ruby, or similar backend languages.
Experience with search infrastructure and vector search systems.

Responsibilities

Design, implement, and maintain retrieval infrastructure and APIs.
Integrate dense retrieval and hybrid retrieval models into live systems.
Optimize latency and throughput of retrieval systems.

Skills

Backend engineering experience

Strong programming skills in Python or Ruby

Experience with search infrastructure

Understanding of retrieval pipelines

Familiarity with real-time data pipelines

Experience with distributed systems

Familiarity with cloud infrastructure

Strong debugging and profiling skills

Tools

Elasticsearch

Docker

Kubernetes

GCP

Kafka

Overview

Klue is hiring a Backend Engineer, AI / ML to join our team in Toronto to work on LLM-powered search and retrieval agents. In this role, you will build and optimize the retrieval infrastructure, APIs, and data pipelines that power our ML team’s agentic workflows, enabling fast, accurate, and scalable retrieval for advanced user-facing systems.

What you’ll do on a day to day basis

Design, implement, and maintain retrieval infrastructure and APIs that interface seamlessly with LLM-based agent workflows.
Integrate dense retrieval, hybrid retrieval, and re-ranking models into live systems.
Optimize latency, scalability, and throughput of retrieval systems for real-time agentic pipelines.
Build and maintain vector database infrastructure (FAISS, Milvus, Weaviate, Pinecone, PGVector) and traditional search engines (Elasticsearch, OpenSearch).
Support retrieval-augmented generation (RAG) workflows, including efficient query execution, chunk retrieval, and caching strategies.
Develop monitoring and observability tools for retrieval pipelines to ensure reliability and transparency.
Work with ML engineers on data pipelines for indexing and re-indexing, enabling continuous improvement of search relevance.
Contribute to the architecture of multi-step retrieval agents, ensuring clean abstractions between the backend and ML layers.

What experience we’re looking for

3+ years of backend engineering experience, ideally in search, retrieval systems, or high-scale APIs.
Strong programming skills in Python, Ruby, or similar backend languages.
Experience with search infrastructure (Elasticsearch, OpenSearch, Vespa) and vector search systems.
Understanding of retrieval pipelines, dense retrieval, and hybrid search.
Familiarity with real-time data pipelines (Kafka, Pub / Sub) for indexing workflows.
Experience with distributed systems and microservices, with a focus on reliability and performance.
Familiarity with cloud infrastructure (AWS, GCP, Azure) and container orchestration (Kubernetes).
Ability to work collaboratively with ML Engineers, understanding their experimentation workflows and constraints.
Strong debugging and profiling skills for production systems.

Nice to Have

Experience with retrieval-augmented generation (RAG) or agentic retrieval workflows.
Exposure to prompt engineering and LLM system integration.
Contributions to open-source projects in search or retrieval.

What makes you thrive at Klue?

Take ownership and run with ambiguous problems
Jump into new areas and rapidly learn what's needed to deliver solutions
Bring scientific rigor while maintaining a pragmatic delivery focus
See unclear requirements as an opportunity to shape the solution

Technologies we use

LLM platforms: OpenAI, Anthropic, open-source models
ML frameworks: PyTorch, Transformers, spaCy
Search / Vector DBs: Elasticsearch, Pinecone, PostgreSQL
MLOps tools: Weights & Biases, MLflow, Langfuse
Infrastructure: Docker, Kubernetes, GCP
Development: Python, Git, CI / CD

Working style

Hybrid. Best of both worlds (remote & in-office). You and your team will be in the office 2 days a week.
Our main Canadian hubs are in Vancouver and Toronto, and most of our teams are located in EST and PST.

Compensation & Benefits

Pay range: CA$145,000 - CA$180,000 per year
Benefits. Extended health & dental benefits that kick in Day 1
Options. Opportunity to participate in our Employee Stock Option Plan
Time off. Take what you need. The average Klue team member takes 2-4 weeks of PTO per year, with approval from your team in advance
Direct access to our leadership team, including our CEO

Equal Opportunity

At Klue, we’re dedicated to creating an inclusive, equitable and diverse workplace as an equal-opportunity employer. Our commitment is to build a high-performing team where people feel a strong sense of belonging, can be their authentic selves, and are able to reach their full potential. If there’s anything we can do to make our hiring process more accessible or to better support you, please let us know, we’re happy to accommodate.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.