Enable job alerts via email!

Data Scientist- Gen AI

Scrumconnect Consulting

City Of London

Hybrid

GBP 70,000 - 90,000

Full time

3 days ago
Be an early applicant

Job summary

A leading consultancy firm in London is seeking an experienced Data Scientist specializing in Generative AI. The role involves designing and shipping AI-powered tools independently from concept to production. Candidates should have 7+ years of experience, strong skills in Python and SQL, and a proven ability to deliver GenAI products. Competitive compensation and flexible working conditions are offered.

Qualifications

  • 7+ years in Data Science/ML with hands-on delivery of GenAI products.
  • Proven ability to ship prototypes to production-ready tools.
  • Strong Python & SQL skills; solid software engineering habits.

Responsibilities

  • Build GenAI tools end-to-end independently.
  • Own evaluation & safety, creating offline/online eval sets.
  • Productionise and package as services/APIs.

Skills

Data Science
Generative AI
Python
SQL
Machine Learning

Tools

PyTorch
Docker
AWS
GitHub Actions
Streamlit
Job description
Overview

We are hiring a Data Scientist with strong Generative-AI experience to design, build, and ship AI-powered tools end-to-end. You will work in a small, multi-disciplinary team and take ownership from discovery to deployment: scoping use-cases, building prototypes, hardening them for production, and putting the right evaluation and governance around them.

What you’ll do
  • Build GenAI tools end-to-end (independently): chat/assistants, document Q&A (RAG), summarisation, classification, extraction, and workflow/agent automations.
  • Own evaluation & safety: create offline/online eval sets, measure faithfulness/hallucination, bias, safety, latency and cost; add guardrails and red-teaming.
  • Productionise: package as services/APIs or lightweight apps (e.g., Streamlit/Gradio/React), containerise, and integrate via CI/CD.
  • Data pipelines: design chunking/embedding strategies, pick vector stores, manage prompt/versioning, and monitor drift & quality.
  • Model strategy: select and mix providers (hosted and open-source), fine-tune where sensible, and optimise for cost/perf/privacy.
  • Stakeholder enablement: translate problems into measurable KPIs, run discovery, document clearly, and hand over maintainable solutions.
  • Good practice: apply data ethics, security and privacy by design; align to service standards and accessibility where relevant.
Tech you’ll likely use
  • Python (pandas, PyTorch/Transformers), SQL
  • LLM frameworks: LangChain, LlamaIndex (or similar)
  • Vector DBs: FAISS / pgvector / Pinecone (or similar)
  • Cloud & Dev: Azure/AWS/GCP, Docker, REST APIs, GitHub Actions/CI
  • Data & MLOps: BigQuery/Snowflake, MLflow/DVC, dbt/Airflow (nice to have)
  • Front ends (for internal tools): Streamlit / Gradio / basic React
Must-have experience
  • 7+ years in Data Science/ML, including hands-on delivery of GenAI products (not just PoCs).
  • Proven ability to ship independently: from idea prototype to production-ready tool.
  • Strong Python & SQL; solid software engineering habits (testing, versioning, CI/CD).
  • Practical LLM skills: prompt design, RAG, tool/function calling, evaluation & guardrails, and prompt/model observability.
  • Sound grasp of statistics/experimentation (A/B tests, hypothesis testing) and communicating impact to non-technical audiences.
  • Data governance, privacy and secure handling of sensitive data.
Nice to have
  • Experience in regulated or public-sector-like environments.
  • Azure OpenAI / Vertex AI / Bedrock; lightweight fine-tuning/LoRA.
  • Front-end skills to craft usable internal UIs.

Please apply with your CV at LNKD1_UKTJ

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.