Enable job alerts via email!

LLM Architect

Bright Purple

City of Edinburgh

On-site

GBP 100,000 - 120,000

Full time

13 days ago

Job summary

A renowned R&D technology company in Edinburgh is looking for an LLM Architect to design and optimize cloud-native architectures that run large language models. The ideal candidate will have expertise in large-scale ML systems and programming skills in C++/Rust/Go. This role offers exceptional benefits and the chance to tackle significant challenges in distributed AI systems, contributing to the future of AI infrastructure.

Benefits

Exceptional benefits

Diversity and inclusion initiatives

Qualifications

Deep understanding of large-scale ML systems engineering, with experience in deploying or optimising LLMs.
Hands-on expertise in C++/Rust/Go for systems programming, plus Python for model integration.
Strong knowledge of distributed runtimes and scheduling frameworks.
Experience with GPU cluster management and performance tuning across accelerators.
Solid grasp of cloud-native orchestration and observability tooling.

Responsibilities

Designing cloud-native architectures on serverless frameworks.
Developing approaches to reduce cold-start latency.
Building distributed inference pipelines for LLMs.
Experimenting with quantisation and pruning for maximum throughput.
Working closely with applied researchers to create robust production systems.

Skills

Large-scale ML systems engineering

C++/Rust/Go programming

Distributed runtimes and scheduling frameworks

GPU cluster management

Cloud-native orchestration

Observability tooling

Technical depth in AI

Tools

Kubernetes

Docker

Prometheus

Grafana

Jaeger

LLM Architect
Edinburgh (on-site)
£100k-120k + exceptional benefits

A rare chance to drive the future of AI infrastructure at one of the world's leading R&D tech organisations.

This is a senior opportunity with a global research leader, where you’ll architect and optimise the platforms that deliver large-scale language models to production. You’ll be working on some of the hardest challenges in distributed AI systems: building ultra-reliable, ultra-scalable environments for inference and deployment.

What you’ll be doing

Designing cloud-native architectures to run large language models on serverless frameworks (e.g. Kubernetes, Knative, or custom-built FaaS).

Developing approaches to minimise cold-start latency through advanced container snapshotting, weight pre-loading, and graph partitioning.

Building distributed inference pipelines with tensor parallelism, model sharding, and efficient memory scheduling to serve LLMs at scale.

Experimenting with quantisation, pruning, and KV-cache management to squeeze maximum throughput from GPU/accelerator clusters.

Working closely with applied researchers to turn state-of-the-art methods into robust, production-grade systems.

What you’ll bring

Deep understanding of large-scale ML systems engineering, with direct experience in deploying or optimising LLMs.

Hands-on expertise in C++/Rust/Go for systems programming, plus Python for model integration.

Strong knowledge of distributed runtimes and scheduling frameworks (e.g. Ray, Dask, MPI, or custom equivalents).

Experience with GPU cluster management (CUDA, NCCL, Triton Inference Server) and performance tuning across accelerators.

Solid grasp of cloud-native orchestration (Docker, Kubernetes, Helm) and observability tooling (Prometheus, Grafana, Jaeger).

Proven ability to translate cutting-edge research into engineered solutions that can scale globally.

Why this role stands out

Influence how next-generation LLM services are built and delivered to millions of users worldwide.

Operate at the intersection of distributed systems, high-performance computing, and AI research.

Join a global R&D organisation with unmatched resources, where innovation isn’t just encouraged — it’s expected.

This role is designed for an engineer who thrives on technical depth, large-scale challenges, and building systems that change what’s possible in AI.

Apply now through Bright Purple to take on one of the most impactful engineering roles available in Europe today.

Bright Purpleis proud to be an equal opportunities employer. We partner with clients who value and actively promote diversity and inclusion across the technology sector.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

LLM Architect

Bright Purple

City of Edinburgh

On-site

GBP 100,000 - 120,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Tools

Company

Services

Free resources

Support

LLM Architect

Bright Purple

City of Edinburgh

On-site

GBP 100,000 - 120,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Tools

Follow us

Company

Services

Free resources

Support