What You’ll Do:
- Build Agentic Systems – Develop and optimise LLM-based agents that can reason, plan, and execute multi-step tasks autonomously.
- Enhance LLM Reasoning – Apply reinforcement learning, tool use, and reflection techniques to strengthen decision‑making and contextual understanding.
- Design Scalable Frameworks – Create data pipelines, annotation tools, and evaluation flywheels that accelerate model iteration and feedback loops.
- Collaborate with Research Teams – Translate experimental findings into production‑grade systems that extend the autonomy and reliability of agents.
- Run Experiments End‑to‑End – Own your compute environment (e.g. Jupyter, Colab, Databricks) and iterate on large‑scale LLM training and evaluation.
What You’ll Bring:
- 4+ years’ experience in Machine Learning or AI, with exposure to LLM agent systems, tool‑use frameworks, or generative AI.
- Advanced proficiency in Python and ML frameworks such as PyTorch or TensorFlow.
- Hands‑on experience developing or fine‑tuning LLaMA, GPT, or similar foundation models.
- Understanding of agentic architectures, chain‑of‑thought reasoning, and memory/reflection mechanisms.
- Proven ability to debug, optimise, and scale ML experiments and compute pipelines.
- MSc or PhD in AI, Computer Science, or related field.
Bonus Skills:
- Research track record (papers, open‑source, Kaggle).
- Experience building back‑end APIs or front‑end interfaces for model annotation or experimentation.
Contract Details:
- Competitive Rates (Outside IR35)
- 6–12 months (with likely extension)
- Remote – UK based
Please apply for immediate consideration.
Company: Experis UK