A technology recruiting firm in Staines-upon-Thames is seeking an experienced Data Scientist with expertise in Generative AI for a 3-month contract. The role involves end-to-end development of AI-powered tools, leading evaluation efforts, and managing data pipelines. Candidates should have a minimum of 7 years in Data Science and demonstrable experience delivering production-ready AI solutions. This position offers competitive compensation and a collaborative environment.
Qualifications
Minimum 7 years in Data Science/ML, delivering Generative AI products.
Ability to independently deliver production-ready tools.
Strong proficiency in Python and SQL.
Experience with LLMs and data governance compliance.
Responsabilités
Develop and deploy Generative AI tools.
Lead evaluation efforts and create test sets.
Design and manage data pipelines.
Ensure best practices in data ethics and privacy.
Connaissances
Generative AI
Python
SQL
Data Science
Machine Learning
Outils
PyTorch
Transformers
Azure
AWS
GCP
Docker
GitHub Actions
Streamlit
Gradio
Description du poste
Job Description
Role Overview - 3 month contract
We are seeking an experienced Data Scientist with a strong background in Generative AI to design, build, and deploy AI-powered tools end-to-end. You will work within a small, multi-disciplinary team and take full ownership of projects—from initial discovery through to production deployment. This includes scoping use cases, building prototypes, productionising solutions, and implementing robust evaluation and governance frameworks.
Key Responsibilities
Develop and deploy Generative AI tools independently, including chat assistants, document Q&A (RAG), summarisation, classification, extraction, and agent-based workflow automation.
Lead evaluation and safety efforts, including the creation of offline/online test sets, and measurement of faithfulness, hallucination, bias, latency, and cost. Implement guardrails and red-teaming strategies.
Package solutions as services, APIs, or lightweight applications (e.g., Streamlit, Gradio, React), and integrate them via CI/CD pipelines.
Design and manage data pipelines, including chunking and embedding strategies, vector store selection, prompt versioning, and monitoring for drift and quality.
Define model strategy, selecting and combining hosted and open-source providers, fine-tuning where appropriate, and optimising for performance, cost, and privacy.
Translate stakeholder requirements into measurable KPIs, lead discovery sessions, document solutions clearly, and ensure maintainability.
Apply best practices in data ethics, security, and privacy, and align solutions with service standards and accessibility requirements.
* Le salaire de référence se base sur les salaires cibles des leaders du marché dans leurs secteurs correspondants. Il vise à servir de guide pour aider les membres Premium à évaluer les postes vacants et contribuer aux négociations salariales. Le salaire de référence n’est pas fourni directement par l’entreprise et peut pourrait être beaucoup plus élevé ou plus bas.