Job Search and Career Advice Platform

Enable job alerts via email!

Linux/HPC Systems Engineer

Advanced Micro Devices

United Kingdom

Hybrid

GBP 60,000 - 80,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading technology company in the United Kingdom is seeking a skilled Linux/HPC Systems Engineer to manage high-performance computing clusters. The ideal candidate will have a strong background in DevOps, proficiency in Kubernetes, CI/CD tools, and scripting in Python or Bash. Responsibilities include deploying Slurm-managed HPC clusters, maintaining GPU compute nodes, and automating infrastructure. Candidates should hold at least a bachelor's degree in a relevant field and have solid industry experience. The company offers a hybrid work model and competitive benefits.

Benefits

Competitive salary
Hybrid work model
Comprehensive benefits package

Qualifications

  • Strong experience managing high-performance computing clusters.
  • Proficient in CI/CD tools like Buildkite and GitHub Actions.
  • Able to automate infrastructure with Ansible, Terraform, and scripting.

Responsibilities

  • Deploy and maintain Slurm-managed HPC clusters.
  • Manage GPU compute nodes and high-speed interconnects.
  • Automate infrastructure provisioning with Python and Bash.

Skills

Linux administration
Kubernetes
CI/CD tools
Scripting (Python/Bash)
Infrastructure automation (Ansible)

Education

Bachelor's or master's degree in computer/software engineering

Tools

Docker
Slurm
Terraform
Grafana
Prometheus
Job description
WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.

Together, we advance your career.
THE ROLE:

We are seeking a highly skilled Linux/HPC Systems Engineer with experience in managing both high-performance computing clusters and modern DevOps infrastructure. The ideal candidate combines expertise in Slurm-managed HPC clusters, GPU compute environments, CI/CD pipelines, and Kubernetes-based orchestration. This person thrives in collaborative, fast-paced environments, drives technical execution with minimal oversight, and has a passion for building reliable, scalable, and high-performance systems.

THE PERSON:

The ideal candidate is a skilled engineer with a strong background in DevOps, site reliability, or infrastructure engineering. They are proficient in Kubernetes, CI/CD tools, scripting (Python/Bash), and infrastructure automation frameworks such as Ansible. Experience working with GPU compute environments and integrating automated test workflows is highly valued. This person thrives in collaborative, fast-paced environments and can drive technical execution with minimal oversight. They bring a problem-solving mindset, strong communication skills, and a passion for building reliable, scalable systems.

KEY RESPONSIBILITIES:
  • Deploy, configure, and maintain HPC clusters using Slurm.
  • Manage GPU compute nodes, high-speed interconnects, and parallel storage systems.
  • Design and maintain CI/CD pipelinesusing Buildkite, GitHub Actions, Jenkins.
  • Automate infrastructure provisioning and configuration with Ansible, Terraform, Python, Bash.
  • Deploy containerized applications using Docker, Kubernetes, Helm.
  • Monitor cluster health and performance; build dashboards with Grafana, Prometheus, Checkmk.
  • Collaborate across teams to optimize workflows, troubleshoot issues, and document best practices.
PREFERRED EXPERIENCE:
  • Strong experience with Slurm or equivalent HPC schedulers.
  • CI/CD, DevOps tools, and automation expertise.
  • GPU compute and lifecycle management (CUDA/ROCm).
  • Linux administration, shell scripting, and distributed systems troubleshooting.
  • Containerization and orchestration (Docker, Kubernetes, Helm).
  • Agile, collaborative mindset with strong communication skills.
ACADEMIC CREDENTIALS:
  • Bachelor's or master's degree in computer/software engineering, Computer Science, or related technical discipline
  • Solid years of industry experience

#LI-EV1

#LI-REMOTE

Benefits offered are described:

AMD benefits at a glance

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.