Enable job alerts via email!

Staff Machine Learning Performance Engineer, Inference Optimisation

Wayve

City Of London

Hybrid

GBP 80,000 - 100,000

Full time

Today

Be an early applicant

Job summary

A pioneering AI technology company in London is looking for a Staff Machine Learning Performance Engineer to lead projects aimed at optimising machine learning inference for edge devices. This role involves collaborating with teams to identify improvements, develop technical roadmaps, and work with advanced AI models. Ideal candidates possess strong experience in optimisation and programming, along with excellent communication skills. The position offers a hybrid working model that promotes innovation and teamwork.

Qualifications

Experience solving optimisation problems with latency constraints.
Experience leading technical teams of 5+ people.
Excellent interpersonal and communication skills.
Experience in ML development is valuable, but not required.

Responsibilities

Identify opportunities for improvement in the ML compiler and/or kernels.
Develop with multiple target platforms in mind.
Build technical roadmaps and execute against them.
Collaborate closely with model developers and software engineers.

Skills

Optimisation problem-solving

MLIR

TensorRT

Cuda

OpenCL

Triton

Interpersonal skills

Python

C++

Tools

Nvidia SoCs

Qualcomm SoCs

Staff Machine Learning Performance Engineer, Inference Optimisation

London

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, veteran status, pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law.

About us

Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems.

Our vision is to create autonomy that propels the world forward. Our intelligent, mapless, and hardware-agnostic AI products are designed for automakers, accelerating the transition from assisted to automated driving. In our fast‑paced environment, big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future.

At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment; we back each other to deliver impact.

Make Wayve the experience that defines your career!

The role

As a Staff/Principal ML Performance Engineer, you’ll lead high‑impact projects optimising ML inference for edge accelerators and GPUs. The focus of this team is to run large transformer‑based models efficiently in low‑cost, low‑power edge devices to enable Wayve’s first driving product. This is an exciting opportunity to lead in several high‑impact, early‑stage projects at Wayve, operating at the intersection of ML Compilers, Kernels, and ML engineering.

Key responsibilities:

You’ll identify opportunities for improvement in the ML compiler and/or kernels and implement
Develop with multiple target platforms in mind e.g. Nvidia (Thor, Orin), Qualcomm, etc
You’ll build technical roadmaps and work with teams to execute against them
You’ll collaborate closely with model developers and software engineers in other teams across the business
You’ll have the opportunity to develop new skills and experience

About you

Experience solving optimisation problems (e.g. developing systems with latency or other resource constraints)
Experience with any of (or similar): MLIR, TensorRT, Cuda, Qualcomm QNN, Cuda, OpenCL, Triton
Experience leading technical teams (5+ people)
Excellent interpersonal and communication skills
Experience with Nvidia and Qualcomm SoCs and frameworks are valuable, but not required
Experience in ML development is valuable, but not required
Proficiency with Python/C++

This is a full‑time role based in our office in London. At Wayve we want the best of all worlds so we operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships and learning, and time spent working from home. We operate core working hours so you can determine the schedule that works best for you and your team.

We understand that everyone has a unique set of skills and experiences and that not everyone will meet all of the requirements listed above. If you’re passionate about self‑driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.

DISCLAIMER: We will not ask about marriage or pregnancy, care responsibilities or disabilities in any of our job adverts or interviews. However, we do look to capture information about care responsibilities, and disabilities among other diversity information as part of an optional DEI Monitoring form to help us identify areas of improvement in our hiring process and ensure that the process is inclusive and non‑discriminatory.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.