Enable job alerts via email!

Principal Data Scientist

JR United Kingdom

Bolton

On-site

GBP 60,000 - 80,000

Full time

23 days ago

Job summary

A leading technology company in Bolton is seeking a Principal Data Scientist to design and build Gen AI virtual agents supporting customers across multiple channels. You will lead the development of LLM-powered experiences, focusing on automation, orchestration, and performance measurement. Ideal candidates possess a relevant degree, strong skills in machine learning, and hands-on experience with AWS and Azure services.

Qualifications

  • Proven expertise in mathematics and classical ML algorithms.
  • Hands-on with AWS and Azure services for data/ML.
  • Strong engineering skills in Python, APIs, and containers.
  • Experience with Kubernetes and model serving infrastructure.
  • Strong communication and stakeholder management skills.

Responsibilities

  • Design and build GenAI/LLM virtual agents.
  • Enable orchestration and management of AI interactions.
  • Develop LLM-powered processing workflows.
  • Create testing suites for LLM performance metrics.
  • Automate the machine learning lifecycle from data ingestion to deployment.

Skills

Mathematics and classical ML algorithms
LLMs knowledge (prompting, fine-tuning, RAG)
AWS services
Azure services
Python programming
Containerization (Docker, Kubernetes)
Git and CI/CD
Infrastructure as Code (Terraform/CloudFormation)
Workflow Automation
Data Visualisation

Education

Relevant primary level degree
MSc or PhD

Tools

AWS SageMaker
Azure OpenAI
Kubernetes
GitHub Actions
Azure DevOps
Job description

Social network you want to login/join with:

Principal Data Scientist, bolton, greater manchester

col-narrow-left

Client:

ISx4

Location:

bolton, greater manchester, United Kingdom

Job Category:

Other

-

EU work permit required:

Yes

col-narrow-right

Job Views:

1

Posted:

26.08.2025

Expiry Date:

10.10.2025

col-wide

Job Description:

You will be part of a team designing and building a Gen AI virtual agent to support customers and employees across multiple channels. You will build and run LLM-powered agentic experiences, owning the design, orchestration, MLOps, and continuous improvement.

  • Design & build client-specific GenAI/LLM virtual agents
  • Enable the orchestration, management, and execution of AI-powered interactions through purpose-built AI agents
  • Design, build and maintain robust LLM powered processing workflows
  • Develop cutting edge testing suites related to bespoke LLM performance metrics
  • Develop bespoke testing suites and LLM performance metrics
  • CI/CD for ML/LLM: automated build/train/validate/deploy pipelines for chatbots and agent services
  • IaC - Infrastructure as Code, (Terraform/CloudFormation) to provision scalable cloud for training and real-time inference
  • Observability: monitoring, drift detection, hallucination, SLOs, and alerting for model and service health
  • Serving at scale: containerised, auto-scaling (e.g., Kubernetes) with low-latency inference
  • Data & model versioning; maintain a central model registry with lineage and rollback
  • Deliver a live performance dashboard (intent accuracy, latency, error rates) and a documented retraining strategy
  • Lead and foster creativity around frameworks/models; collaborate closely with product, engineering, and client stakeholders

Qualifications / Experience

  • Relevant primary level degree and ideally MSc or PhD
  • Proven expertise in mathematics and classical ML algorithms, plus deep knowledge of LLMs (prompting, fine-tuning, RAG/tool use, evaluation)
  • Hands-on with AWS and Azure services for data/ML (e.g., Bedrock/SageMaker, Azure OpenAI/Azure ML)
  • Strong engineering: Python, APIs, containers, Git; CI/CD (GitHub Actions/Azure DevOps), IaC (Terraform/CloudFormation)
  • Scalable Serving Infrastructure: A containerized, auto-scaling environment (e.g., using Kubernetes) to serve the chatbot model with low latency
  • Workflow Automation: Automate the end-to-end machine learning lifecycle, from data ingestion and preprocessing to model retraining and deployment
  • Live Performance Dashboard: A real-time dashboard displaying key model metrics such as intent accuracy, response latency, and error rates
  • Centralized Model Registry: A versioned repository for all trained models, their performance metrics, and associated training data
  • Documented Retraining Strategy: An automated workflow and documentation outlining the process for periodically retraining the model on new data
  • Experience with Kubernetes, inference optimisation, caching, vector stores, and model registries
  • Clear communication, stakeholder management, and a habit of writing crisp technical docs and runbooks

Personal Attributes

  • Personal Integrity, Stakeholder Management, Project Management, Agile Methodologies, Automation, Data Visualisation and Analysis.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.