Enable job alerts via email!

AI Engineer

E.O.N Worldwide

Greater London

Hybrid

GBP 100,000 - 125,000

Full time

Yesterday

Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading energy company in the United Kingdom seeks an AI Engineer to enhance their Generative AI Platform. This hybrid role involves designing and implementing crucial services that support internal AI product squads. Key responsibilities include building a reliable Model Serving Layer, engineering infrastructure, and implementing cost tracking mechanisms. Ideal candidates should have at least 2 years of relevant experience and be proficient in Python with knowledge of cloud services. Competitive salary and excellent benefits package included.

Benefits

26 days annual leave plus bank holidays

Generous pension

Life cover

Bonus opportunities

Accessible & supportive workplace

Qualifications

Minimum 2 years experience in Software Engineering, ML Engineering, or Data Engineering.
Extensive hands-on experience with cloud infrastructure and containerization.
Proven experience designing RAG pipelines using vector databases.

Responsibilities

Design, build, and maintain the Model Serving Layer for low-latency access.
Engineer and maintain production-ready Vector Database and RAG infrastructure.
Implement cost tracking and monitoring mechanisms.

Skills

Proficient in Python

Experience with cloud infrastructure (AWS, GCP, Azure)

Familiarity with LLM architectures

Knowledge of agentic frameworks

Understanding of MLOps tooling

Tools

Docker

Terraform

Kubernetes

We are maintaining and enhancing our Generative AI Platform that enables our entire organisation to leverage Generative AI safely, efficiently and at scale. As a

AI Engineer, you will be the force multiplier, moving beyond building end-user applications to developing robust, centralised services that power all internal GenAI product squads.

You will be instrumental in designing and implementing our award winning Generative AI Platform Model Orchestration Layer, RAG infrastructure, communications layer, agentic layer and centralised governance/safety guardrails etc.

This is a hybrid role, typically working 1 day per week at our Citigen office in London.

Here’s a taste of what you’ll be doing

Design, build, and maintain the highly reliable Model Serving Layer (Gateways, load balancing, caching, throttling) for low-latency access to LLMs and other generative models.
Engineer and maintain production-ready Vector Database and Retrieval-Augmented Generation (RAG) infrastructure, including high-throughput indexing pipelines and efficient retrieval strategies for enterprise data.
Develop and manage a standardised, secure Agent Framework/SDK (e.g., based on LangGraph or a custom library) that promotes consistency and maintainability across all agents built from the platform.
Implement robust mechanisms for cost tracking, monitoring, and precise cost allocation/chargebacks per squad, alongside implementing rate-limiting and budget controls.
Systematically benchmark, curate, and integrate the most cost-effective and performant models for distinct use cases.
Build and maintain scalable, secure infrastructure and data pipelines for custom model fine-tuning requested by product teams.
Architect and deploy platform-level safety filters (e.g., toxicity, PII masking, jailbreak prevention) that are enforced across all model inputs and outputs
Build and maintain the centralised logging, monitoring, and observability stack to track crucial GenAI metrics (hallucination rate, latency, token usage, and drift detection).
Ensure the platform and RAG data handling adheres to strict compliance and data privacy regulations (e.g., GDPR) through automated enforcement and auditing.
Drive the quality of the developer experience by writing exceptional SDKs, clear APIs, comprehensive documentation, and contributing to internal knowledge sharing (e.g. through workshops).

Are we the perfect match?

Minimum 2 years of experience in Software Engineering, ML Engineering, Data Science, or Data Engineering with a (bonus) specialised focus on building Generative AI products or agent-based systems.
Proficiency in Python and developing highly concurrent, scalable API services and microservices in a cloud environment.
Extensive, hands‑on experience with cloud infrastructure (AWS, GCP, or Azure), Infrastructure as Code (Terraform), Docker - containerisation. An understanding of kubernetes.
Understanding of LLM architectures, inference serving optimization techniques (quantisation, caching), and MLOps tooling (e.g., MLFlow, Kubeflow).
Proven experience designing, implementing, and optimising complex RAG pipelines using vector databases (e.g., Pinecone, PGVector).
Practical experience with agentic frameworks (e.g., LangChain, LlamaIndex, Langgraph, or custom implementation) and defining tool‑use patterns.
Demonstrated experience designing complex, multi‑tenant platform systems, prioritising reliability, security, and separation of concerns.
Proven ability to engineer for operational efficiency, with a track record of implementing cost‑saving measures in large‑scale cloud deployments.
A strong commitment to improving Developer Experience (DX) and acting as a technical consultant for internal product teams.

Here’s what you need to know

Role may close earlier due to high applications.
Competitive salary.
Location - Citigen, 47-53 Charterhouse Street, London, EC1M 6PB
Excellent parental leave allowance.
Award‑Winning Workplace – We’re proud to be named a Sunday Times Best Place to Work 2025 and the Best Place to Work for 16‑34‑year‑olds.
Outstanding Benefits – Enjoy 26 days of annual leave plus bank holidays, a generous pension, life cover, bonus opportunities and access to 20 flexible benefits with tax/NI savings.
Flexible & Family‑Friendly – Our industry‑leading hybrid and family‑friendly policies earned us double recognition at the Personnel Today Awards 2024. We’re open to discussing how flexibility can work for you.
Inclusive & Diverse – We’re the only energy company in the Inclusive Top 50 UK Employers. We’re also proud winners of Best Employer for Women and Human Company of the Year‑recognising our inclusive, people‑first culture.
Support at Every Stage of Life – We’re Fertility Friendly and Menopause Friendly accredited, with inclusive support for everyone.
Accessible & Supportive – Do you consider yourself as having a disability? As a Disability Confident Employer, we guarantee interviews for disabled applicants who meet the minimum criteria for the role and will make any adjustments needed during the process.
Invested in Your Growth – From inclusive talent networks to top‑tier development programmes, we’ll support your growth every step of the way.
For all successful candidates — due to the nature of this role your employment will be subject to a basic DBS (Disclosure Barring Service) check being carried out by ourselves via a 3rd party service provider.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.