Enable job alerts via email!
Boost your interview chances
Create a job specific, tailored resume for higher success rate.
Join a forward-thinking company as a Senior Machine Learning Engineer, where you'll work on cutting-edge LLM-powered agents that automate workflows. This role offers a unique opportunity to shape the future of insight generation systems while collaborating with a talented team. You'll dive deep into ML applications, optimizing retrieval systems, and developing scalable ML tools. With a hybrid working style, you can enjoy the flexibility of remote work while still engaging with your team in the office. If you're passionate about machine learning and eager to make a real impact, this is the perfect opportunity for you to thrive.
We're looking for a Senior Machine Learning Engineer to join our ML Foundation and Platform team in Toronto, focusing on building and optimizing state-of-the-art LLM-powered agents that can reason, plan and automate workflows for users. You'll be joining us at an exciting time as we reinvent our insight generation systems, making this an excellent opportunity for someone with strong ML and IR fundamentals who wants to dive deep into practical LLM applications.
As a member of our team, you'll be focusing on optimizing LLM-based agents, creating a platform for other teams to utilize ML capabilities and deploying ML services to production.
You'll measure and improve retrieval systems across the spectrum from BM25 to semantic search and develop comprehensive evaluation metrics to measure their performance. A key challenge will be developing optimal chunking and enrichment strategies for diverse data sources including news articles, website changes, documents, CRM entries, call recordings and internal communications. You'll explore how different data types and formats impact retrieval performance and develop strategies to maintain high relevance across all sources.
Beyond agents and retrieval, you'll work on building a platform for other teams to effectively utilize LLM tools and take advantage of prompt engineering. This includes developing APIs and scalable systems, developing scalable tools and services to handle machine learning training and inference for our clients, writing zero-shot and few-shot prompts with structured inputs/outputs, and implementing benchmarking systems for prompts.
You'll also work on training and fine-tuning smaller, more efficient models that can match the performance of LLMs at a fraction of the cost. This includes creating labeled datasets (sometimes using prompts), conducting careful hyperparameter optimizations, and building automated training pipelines. You'll also deploy and monitor these models in production, optimize their latency, and implement comprehensive offline/online metrics to track their performance.
Throughout all this work, you'll apply your deep understanding of the latest breakthroughs to build scalable, production-ready systems that turn cutting-edge ML experiments into reliable business value.
We're looking for builders who:
Not ticking every box? That’s okay. We take potential into consideration. An equivalent combination of education and experience may be accepted in lieu of the specifics listed above. If you know you have what it takes, even if that’s different from what we’ve described, be sure to explain why in your application.
At Klue, we're dedicated to creating an inclusive, equitable and diverse workplace as an equal-opportunity employer. Our commitment is to build a high-performing team where people feel a strong sense of belonging, can be their authentic selves, and are able to reach their full potential. If there’s anything we can do to make our hiring process more accessible or to better support you, please let us know, we’re happy to accommodate.