Social network you want to login/join with:
Senior / Staff Software Engineer (AI / Compiler), Portsmouth, Hampshire
Company Overview
Flux is pioneering a new class of AI accelerators called Optical Tensor Processing Units (OTPUs). We’ve already developed functioning prototypes and are now scaling our operations in London. Our work environment rewards innovation, speed, and bold thinking.
The role
We’re hiring Senior and Staff Software Engineers to build the high-performance computing infrastructure that powers our OTPUs. This role involves rethinking how AI workloads are executed at speed and scale, focusing on designing and implementing software systems for distributed, low-latency inference across clusters.
Responsibilities
- Design and build high-performance systems for running AI/ML workloads across distributed compute clusters
- Optimize for ultra-low latency and real-time inference at scale—profiling, tuning, and rewriting critical systems as needed
- Identify and resolve performance bottlenecks across the stack, from model execution and scheduling to hardware-level constraints
- Collaborate with compiler engineers to improve code generation, execution paths, and memory layouts using tools like LLVM or MLIR
- Work with hardware teams to ensure the software stack fully leverages the capabilities of our OTPU architecture
- Extend ML frameworks (e.g., PyTorch, ONNX, OpenXLA) to support performance-critical inference paths
- Lead design reviews, mentor engineers, and promote best practices in HPC and performance engineering
- Stay updated with new developments in AI infrastructure, compute systems, and compiler tooling
Skills & Experience
- 5+ years of experience building performance-critical systems in HPC, HFT, large-scale simulation, or AI infrastructure
- Deep understanding of distributed systems with a focus on real-time or near real-time data processing
- Strong programming skills in C++ and Python for performance-sensitive applications
- Hands-on experience with ML compilers (e.g., LLVM, MLIR), and knowledge of runtime and scheduling optimizations
- Practical knowledge of ML frameworks like PyTorch, ONNX, or OpenXLA, and how to optimize their execution
- Experience scaling AI workloads across clusters or custom infrastructure
- Strong debugging, profiling, and performance-tuning skills across the stack
- Degree in Computer Science, Engineering, Mathematics, or a related field
Details
- Competitive salary from £145k+, depending on experience
- Stock options in a rapidly growing AI company
- Based in our new 5,000 sq. ft. office in the AI hub of Kings Cross, London
- Flux hires candidates within a 45-minute commute of our office—offering an extra £24k/year incentive if you choose to live within 20 minutes