Enable job alerts via email!

Senior / Staff Software Engineer (AI / Compiler)

JR United Kingdom

Portsmouth

On-site

GBP 145,000 - 180,000

Full time

7 days ago

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in AI technology is seeking a Senior / Staff Software Engineer to develop high-performance computing systems for their Optical Tensor Processing Units (OTPUs). The role emphasizes optimizing AI workloads and collaborating with a team on cutting-edge advancements in AI infrastructure and compiler technology. The ideal candidate will have substantial experience in HPC, strong programming skills, and a passion for driving innovation in a rapidly evolving field.

Benefits

Competitive salary from £145k+

Stock options in a growing AI company

Incentive for living within 20 minutes of office

Qualifications

5+ years of experience in building performance-critical systems
Strong debugging and profiling skills
Deep understanding of distributed systems with real-time processing focus

Responsibilities

Design and build high-performance systems for AI/ML workloads
Optimize systems for ultra-low latency and real-time inference
Collaborate with compiler engineers to enhance code execution

Skills

Distributed systems

Performance tuning

C++

Python

AI infrastructure

Education

Degree in Computer Science, Engineering, Mathematics, or related field

Tools

ML compilers (LLVM, MLIR)

ML frameworks (PyTorch, ONNX, OpenXLA)

Social network you want to login/join with:

Senior / Staff Software Engineer (AI / Compiler), Portsmouth, Hampshire

Company Overview

Flux is pioneering a new class of AI accelerators called Optical Tensor Processing Units (OTPUs). We’ve already developed functioning prototypes and are now scaling our operations in London. Our work environment rewards innovation, speed, and bold thinking.

The role

We’re hiring Senior and Staff Software Engineers to build the high-performance computing infrastructure that powers our OTPUs. This role involves rethinking how AI workloads are executed at speed and scale, focusing on designing and implementing software systems for distributed, low-latency inference across clusters.

Responsibilities

Design and build high-performance systems for running AI/ML workloads across distributed compute clusters
Optimize for ultra-low latency and real-time inference at scale—profiling, tuning, and rewriting critical systems as needed
Identify and resolve performance bottlenecks across the stack, from model execution and scheduling to hardware-level constraints
Collaborate with compiler engineers to improve code generation, execution paths, and memory layouts using tools like LLVM or MLIR
Work with hardware teams to ensure the software stack fully leverages the capabilities of our OTPU architecture
Extend ML frameworks (e.g., PyTorch, ONNX, OpenXLA) to support performance-critical inference paths
Lead design reviews, mentor engineers, and promote best practices in HPC and performance engineering
Stay updated with new developments in AI infrastructure, compute systems, and compiler tooling

Skills & Experience

5+ years of experience building performance-critical systems in HPC, HFT, large-scale simulation, or AI infrastructure
Deep understanding of distributed systems with a focus on real-time or near real-time data processing
Strong programming skills in C++ and Python for performance-sensitive applications
Hands-on experience with ML compilers (e.g., LLVM, MLIR), and knowledge of runtime and scheduling optimizations
Practical knowledge of ML frameworks like PyTorch, ONNX, or OpenXLA, and how to optimize their execution
Experience scaling AI workloads across clusters or custom infrastructure
Strong debugging, profiling, and performance-tuning skills across the stack
Degree in Computer Science, Engineering, Mathematics, or a related field

Details

Competitive salary from £145k+, depending on experience
Stock options in a rapidly growing AI company
Based in our new 5,000 sq. ft. office in the AI hub of Kings Cross, London
Flux hires candidates within a 45-minute commute of our office—offering an extra £24k/year incentive if you choose to live within 20 minutes

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs