Job Search and Career Advice Platform

Enable job alerts via email!

Machine Learning Operations Engineer

Ssc Recruitment Solutions Ltd

Greater London

On-site

GBP 60,000 - 80,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A recruitment agency is looking for an experienced ML Ops Engineer to enhance their research and development team based in Greater London. The role involves managing an on-premise Kubernetes cluster, utilizing AWS services, and designing MLOps solutions. You will be working with cutting-edge tools and have the potential to lead your own projects. This opportunity requires a strong background in relevant technologies and a passion for innovation in machine learning applications.

Qualifications

  • Relevant academic or industry experience at a Master’s level.
  • Excellent knowledge and experience managing Kubernetes clusters.
  • Experience with ML toolkits and designing MLOps solutions.

Responsibilities

  • Join the ML Operations teams supporting ML Development.
  • Guide the technical direction of the ML Ops team.
  • Lead your own project within the ML Ops framework.

Skills

Kubernetes management
Kubeflow
Python programming
AWS services
ML toolkits (e.g., PyTorch)
DevOps knowledge
Ansible
C++ familiarity

Education

Master’s degree or relevant industry experience
Job description
Key Responsibilities

We are looking for an excellent ML Ops Engineer to join our research and development team.

This opportunity is to join the ML Operations teams which supports the ML Development team in building leading-edge motion capture products through provisioning and maintaining a modern ML Operations stack.

This stack covers data acquisition pipelines, data management and ML model training infrastructure (SW and on-prem HW). We use both on‑prem, self‑managed systems and also leverage AWS infrastructure.

You will have opportunities to guide the technical direction of the ML Ops team, suggest new areas of development and the potential to lead your own project.

Required Skills, Knowledge and Expertise

You will have relevant academic (research Masters level) and / or industry experience.

Essential Skills
  • Excellent knowledge and experience of managing an on-premise Kubernetes cluster.
  • Excellent knowledge of Kubeflow and similar systems, e.g. MLflow
  • Good programming ability in Python with familiarity with Linux systems including scripting and system configuration.
Experience using AWS, e.g, Cognito, S3, EC2, Lamdas, etc.

Experience with ML toolkits, e.g. PyTorch, Lightning, etc., along with a solid understanding of how these fit into ML Ops pipelines and tools.

Be able to design and implement MLOps solutions covering many different technologies.

Desirable Skills
  • Background in DevOps with exposure to CI systems, e.g. Jenkins
  • Familiarity with infrastructure as code, e.g. Ansible
  • Experience, aptitude, and a desire to work with human motion capture, sport, animation tools and techniques.
  • Familiarity with C++.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.