
Enable job alerts via email!
An innovative AI company based in London is seeking a Machine Learning Engineer to develop and optimize AI inference APIs for real-time applications. The ideal candidate has experience with ML systems and deep learning frameworks such as PyTorch and TensorFlow. Responsibilities include improving system reliability and exploring innovative techniques for LLM optimization. Competitive compensation is offered.
Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world’s leading AI platforms. Our objective is to build accurate, trustworthy AI that powers decision-making for people and assistive AI wherever decisions are being made. Our current stack includes Python, Rust, C++, PyTorch, Triton, CUDA, and Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.