Enable job alerts via email!

Machine Learning Ops Engineer

SSC Recruitment Solutions Ltd

Oxford

On-site

GBP 50,000 - 70,000

Full time

21 days ago

Job summary

A leading research and development firm in the UK is seeking an ML Ops Engineer to join their team. The role involves working with advanced ML Operations stacks, managing Kubernetes clusters, and implementing MLOps solutions. Candidates should have strong programming skills in Python and experience with AWS services. This position offers the opportunity to take on a leadership role in ML projects.

Qualifications

  • Relevant academic and/or industry experience.
  • Excellent knowledge of managing an on-premise Kubernetes cluster.
  • Good programming ability in Python with familiarity with Linux systems.

Responsibilities

  • Join the ML Operations teams to support ML Development.
  • Provision and maintain a modern ML Operations stack.
  • Guide technical direction and lead projects.

Skills

Kubernetes management
Kubeflow
Python programming
AWS services
ML toolkits

Education

Research Masters level
Job description
Overview

We are looking for an excellent ML Ops Engineer to join our research and development team.

Responsibilities
  • This opportunity is to join the ML Operations teams which supports the ML Development team in building leading-edge motion capture products through provisioning and maintaining a modern ML Operations stack.
  • This stack covers data acquisition pipelines, data management and ML model training infrastructure (SW and on-prem HW). We use both on-prem, self-managed systems and also leverage AWS infrastructure.
  • You will have opportunities to guide the technical direction of the ML Ops team, suggest new areas of development and the potential to lead your own project.
Required Skills, Knowledge and Expertise
  • You will have relevant academic (research Masters level) and/or industry experience.
  • Excellent knowledge and experience of managing an on-premise Kubernetes cluster.
  • Excellent knowledge of Kubeflow and similar systems, e.g. MLflow.
  • Good programming ability in Python with familiarity with Linux systems including scripting and system configuration.
  • Experience using AWS, e.g. Cognito, S3, EC2, Lambda, etc.
  • Experience with ML toolkits, e.g. PyTorch, Lightning, etc., along with a solid understanding of how these fit into ML Ops pipelines and tools.
  • Be able to design and implement MLOps solutions covering many different technologies.
Desirable Skills
  • Background in DevOps with exposure to CI systems, e.g. Jenkins.
  • Familiarity with infrastructure as code, e.g. Ansible.
  • Experience, aptitude, and a desire to work with human motion, sport, animation tools and techniques.
  • Familiarity with C.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.