
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A financial technology firm in London is seeking an AI Inference Engineer to develop APIs for AI inference used by internal and external customers. Responsibilities include benchmarking, improving system reliability, and exploring LLM optimizations. The ideal candidate has experience with ML systems, deep learning frameworks, and GPU architectures. Competitive salary and equity may be offered.
London
Full time
AI
We are looking for an AI Inference engineer to join our growing team. Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.
Final offer amounts are determined by multiple factors, including, experience and expertise.
Equity: In addition to the base salary, equity may be part of the total compensation package.