Overview
Graduate Data Scientist - Digital Enterprise
Location: London, UK
Salary: Competitive graduate package with benefits
Role: Graduate Digital Enterprise Data Scientist
As a Graduate Data Scientist, you will code, develop, and deploy AI-powered applications, dashboards, and data pipelines that enhance io consulting's workflows and client delivery. You'll extract, analyse, and interpret data using algorithmic, statistical, and machine learning techniques, while gaining experience in cutting-edge tools such as large language models (LLMs), retrieval augmented generation (RAG) systems, and API integrations.
This is a hands-on coding role for an early-career hire - ideal for a graduate who can write AI code, build dashboards, and manage data pipelines, and who is keen to grow into a leading role as io expands its data science capability.
Responsibilities
- AI-Driven Application Development
- Develop, test, and deploy AI-powered applications using APIs and LLMs to address business and project challenges.
- Create prototypes and proof-of-concept applications to demonstrate the feasibility of AI solutions.
- Collaborate with consultants and engineers to integrate AI applications into existing systems and workflows.
- RAG-Driven Generative AI
- Support the design and implementation of retrieval augmented generation (RAG) pipelines, learning to select appropriate architectures for different use cases.
- Code and optimise vector indexing strategies and embedding frameworks to improve information retrieval.
- Build document processing pipelines to convert diverse file formats into searchable knowledge bases.
- Contribute to both static and dynamic RAG systems, ensuring high-quality responses and efficient resource use.
- Assist in developing hybrid RAG architectures combining static and dynamic approaches.
- Support the creation of evaluation frameworks to measure retrieval accuracy and response quality.
- Data Analysis and Insights
- Collaborate with internal teams to identify key business challenges and propose data-driven solutions.
- Design algorithms and experiments to analyse large datasets.
- Implement machine learning tools and statistical models to address business and client problems.
- API Integration and Deployment
- Write and test APIs to integrate AI models into business applications.
- Ensure smooth interaction between AI models and external systems.
- Support monitoring and optimisation of API performance for scalability and efficiency.
- LLM Implementation and Optimisation
- Fine-tune large language models with domain-specific data to improve relevance and accuracy.
- Deploy LLMs for NLP tasks, document summarisation, knowledge extraction, and workflow automation.
- Explore and apply LLM capabilities to enhance data querying, reporting, and decision-support tools.
- Reporting and Visualisation
- Build and maintain interactive dashboards (Tableau, Power BI, Python frameworks) to visualise insights for clients and internal teams.
- Create clear, data-driven reports and visualisations that communicate findings effectively to technical and non-technical audiences.
- Data Infrastructure and Workflow Automation
- Work with engineering teams to establish robust data pipelines and warehousing solutions.
- Contribute to workflow automation initiatives to reduce manual effort.
- Help shape io's long-term approach to structuring, storing, and reusing data and knowledge.
- Research and Innovation
- Research and prototype innovative solutions, keeping up with advances in AI, APIs, and data science.
- Experiment with new frameworks and libraries (e.g. LangChain, LlamaIndex, Hugging Face, PyTorch, TensorFlow).
- Contribute ideas for the commercialisation of io's proprietary analytics and engineering tools.
Key Skills and Qualifications
Essential
- Degree in Data Science, Engineering, Mathematics, Computer Science, or a related discipline.
- Strong coding ability in Python (R, MATLAB, SQL advantageous).
- Demonstrated experience coding AI/ML models (university projects, internships, or personal portfolio).
- Experience building dashboards using Tableau, Power BI, or Python libraries.
- Familiarity with data wrangling, workflow automation, and database management (SQL).
- Ability to communicate technical insights to both technical and non-technical stakeholders.
- Curiosity and enthusiasm for applying AI and data science to real-world challenges.
- Desirable
- Exposure to LLMs, NLP, or generative AI projects.
- Familiarity with cloud platforms (AWS, Azure, or GCP).
- Knowledge of APIs and integration methods (REST, GraphQL).
- Awareness of sustainability, energy, or infrastructure sectors.
- Understanding of containerisation (Docker, Kubernetes) and DevOps practices.
What We Offer
- Opportunities to code and deploy AI applications with real business impact.
- Hands-on exposure to cutting-edge AI and data frameworks.
- Mentorship from senior consultants and technical specialists.
- Professional development through training, certifications, and project work.
- Access to modern cloud and data science platforms.
- A collaborative environment where initiative and innovation are encouraged.
This role is ideal for individuals passionate about leveraging data science, AI, Code and LLMs to build impactful applications and drive meaningful change in a dynamic business environment.