Enable job alerts via email!

Senior / Staff Software Engineer (AI / Compiler)

JR United Kingdom

Hounslow

On-site

GBP 145,000 - 180,000

Full time

7 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a leading AI company in London as a Senior / Staff Software Engineer to work on pioneering Optical Tensor Processing Units. You'll lead the design of high-performance computing infrastructure while collaborating with ML and hardware teams, optimizing AI workloads for speed and efficiency. We offer a competitive salary, stock options, and an attractive incentive for local commuting.

Benefits

Stock options
Extra £24k/year incentive for local living

Qualifications

  • 5+ years in performance-critical systems.
  • Hands-on experience with ML compilers.
  • Deep understanding of real-time data processing.

Responsibilities

  • Design systems for AI workloads across clusters.
  • Optimize systems for ultra-low latency performance.
  • Collaborate with hardware teams to enhance OTPU capabilities.

Skills

C++
Python
Distributed Systems
Performance Tuning
Debugging
Machine Learning Frameworks
Optimizations

Education

Degree in Computer Science
Degree in Engineering
Degree in Mathematics

Tools

LLVM
MLIR
PyTorch
ONNX
OpenXLA

Job description

Social network you want to login/join with:

Senior / Staff Software Engineer (AI / Compiler), south west london

Company Overview

Flux is pioneering a new class of AI accelerators called Optical Tensor Processing Units (OTPUs). We’ve already developed functioning prototypes and are now scaling our operations in London. Our work environment rewards innovation, speed, and bold thinking.

The role

We’re hiring Senior and Staff Software Engineers to build the high-performance computing infrastructure that powers our Optical Tensor Processing Units (OTPUs). This isn’t just about scaling models—it’s about rethinking how AI workloads are executed at speed and scale.

You’ll lead the design and implementation of software systems that run distributed, low-latency inference across clusters. You’ll work closely with hardware and ML teams to optimise every layer of the stack—from model representation and execution to data movement and scheduling. Whether it’s through compiler techniques, systems-level tuning, or custom runtime design, you’ll play a critical role in shaping the performance layer of our AI platform. This is a role for engineers who think in microseconds, not just model accuracy. If you’ve worked in HFT, large-scale scientific compute, or AI infrastructure at serious scale, we’d love to talk.

Responsibilities

  • Design and build high-performance systems for running AI/ML workloads across distributed compute clusters
  • Optimise for ultra-low latency and real-time inference at scale—profiling, tuning, and rewriting critical systems as needed
  • Identify and resolve performance bottlenecks across the stack, from model execution and scheduling to hardware-level constraints
  • Collaborate with compiler engineers to improve code generation, execution paths, and memory layouts using tools like LLVM or MLIR
  • Work with hardware teams to ensure the software stack fully leverages the capabilities of our OTPU architecture
  • Extend ML frameworks (e.g. PyTorch, ONNX, OpenXLA) to better support performance-critical inference paths
  • Lead design reviews, mentor engineers, and promote best practices in HPC and performance engineering
  • Stay on the frontier of new developments in AI infrastructure, compute systems, and compiler tooling

Skills & Experience

  • 5+ years of experience building performance-critical systems in HPC, HFT, large-scale simulation, or AI infrastructure
  • Deep understanding of distributed systems, with a focus on real-time or near real-time data processing
  • Strong programming skills in C++ and Python, especially for performance-sensitive applications
  • Hands-on experience with ML compilers (e.g. LLVM, MLIR), and knowledge of runtime and scheduling optimisations
  • Practical knowledge of ML frameworks like PyTorch, ONNX, or OpenXLA, and how to optimise their execution
  • Experience scaling AI workloads across clusters or custom infrastructure—not just deploying on standard cloud setups
  • Strong debugging, profiling, and performance-tuning skills across the stack
  • Degree in Computer Science, Engineering, Mathematics, or a related field

Details

  • Competitive salary ranging from £145k+, depending on experience.
  • Stock options in a rapidly growing AI company.
  • Based in our new 5,000 sq. ft. office in the AI hub of Kings Cross, London.
  • Flux hires candidates within a 45-minute commute of our office—offering an extra £24k/year incentive if you choose to live within 20 minutes.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior / Staff Software Engineer (AI / Compiler)

JR United Kingdom

Bedford

On-site

GBP 145,000 - 180,000

Today
Be an early applicant

Senior / Staff Software Engineer (AI / Compiler)

JR United Kingdom

Crawley

On-site

GBP 145,000 - 180,000

Today
Be an early applicant

Senior / Staff Software Engineer (AI / Compiler)

JR United Kingdom

Watford

On-site

GBP 145,000 - 180,000

6 days ago
Be an early applicant

Senior / Staff Software Engineer (AI / Compiler)

JR United Kingdom

Luton

On-site

GBP 145,000 - 180,000

6 days ago
Be an early applicant

Senior / Staff Software Engineer (AI / Compiler)

JR United Kingdom

Guildford

On-site

GBP 145,000 - 175,000

6 days ago
Be an early applicant

Senior / Staff Software Engineer (AI / Compiler)

JR United Kingdom

Maidstone

On-site

GBP 145,000 - 167,000

6 days ago
Be an early applicant

Senior / Staff Software Engineer (AI / Compiler)

JR United Kingdom

High Wycombe

On-site

GBP 145,000 - 167,000

6 days ago
Be an early applicant

Chemical Engineer - AI Trainer

ATTB - The Big Jobsite

London

Remote

GBP 125,000 - 150,000

2 days ago
Be an early applicant

Staff Full Stack Engineer - AI LegalTech Scale-Up

Burns Sheehan

City Of London

Remote

GBP 160,000 - 160,000

2 days ago
Be an early applicant