Job Search and Career Advice Platform

Enable job alerts via email!

Infrastructure & MLOps Engineer

Graphcore

Bristol

On-site

GBP 80,000 - 100,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A cutting-edge AI technology company is looking for a skilled Software Infrastructure Engineer to join their dynamic team in Bristol. The role involves developing essential tools for enhancing the AI software development lifecycle, managing Cloud Infrastructure with Terraform and deploying services using Kubernetes and Docker. Candidates should have knowledge of programming languages like Python, experience in Linux environments, and familiarity with cloud services. The company offers a competitive salary and numerous benefits to ensure a great work environment.

Benefits

Flexible working
Generous annual leave policy
Private medical insurance
Health cash plan
Dental plan
Pension (matched up to 5%)
Life assurance
Income protection
Healthy food and snacks
Employee assistance programme

Responsibilities

  • Develop, own, and maintain tools and services to support AI research and engineering teams.
  • Deploy and maintain services with Kubernetes and Docker.
  • Manage our Cloud Infrastructure using tools such as Terraform.

Skills

Python
Cloud services (e.g. AWS)
Linux environments
CI/CD principles
Machine learning applications
ML orchestration tools (e.g. NV Ray, KFP, SkyPilot)
ML accelerator hardware (e.g. DCGM)
Infrastructure as Code (IaC) tools (e.g. Terraform/OpenTofu)
GitHub Actions
Modern observability tooling (e.g. Prometheus)
Grafana
Go/Java/C++ (or similar language)
Job description

Graphcoreis one of the world’s leading innovators in Artificial Intelligence compute.

It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.

As part of the SoftBank Group,Graphcoreis a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.

Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives.A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.

Job Summary

Join our dynamic Software Infrastructure team and take a pivotal role in scaling and managing our infrastructure. You will develop essential tools and services that empower our broader software team. Your contributions will enhance the build, test, deployment, and productisation processes of our Machine Learning Software components. Work with our High-Performance Computing (HPC) AI platforms and gain invaluable experience in distributed systems

The Team

The Software Infrastructure team provides critical platforms and services for software development teams across the business. Our responsibilities include managing the CI platform and services, build engineering, component integration, and packaging and release systems. We operate in squads, fostering a culture of service ownership and empowerment for our engineers. We focus on long‑term engineering solutions and strive to eliminate toil wherever possible.

Responsibilities and Duties
  • Develop, own, and maintain tools and services to support AI research and engineering teams
  • Deploy and maintain services with Kubernetes and Docker
  • Manage our Cloud Infrastructure using tools such as Terraform
Candidate Profile
  • Knowledge of Python
  • Familiarity with cloud services (e.g. AWS)
  • Experience managing or developing in Linux environments
  • Understanding of CI/CD principles
  • Experience maintaining machine learning applications
  • Experience deploying ML orchestration tools (e.g. NV Ray, KFP, SkyPilot)
  • Experience managing ML accelerator hardware (e.g. DCGM)
  • Experience with Infrastructure as Code (IaC) tools (e.g. Terraform/OpenTofu)
  • Experience with GitHub Actions
  • Experience with modern observability tooling (e.g. Prometheus)
  • Experience with Grafana
  • Knowledge of Go/Java/C++ (or similar language)
Benefits

In addition to a competitive salary, Graphcore offers flexible working, a generous annual leave policy, private medical insurance and a health cash plan, a dental plan, pension (matched up to 5%), life assurance and income protection. We have a generous parental leave policy and an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We offer a range of healthy food and snacks at our central Bristol office and have our own barista bar! We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.

Applicants for this position must hold the right to work in the UK. Unfortunately at this time, we are unable to provide visa sponsorship or support for visa applications.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.