Job Search and Career Advice Platform

Enable job alerts via email!

HPC Systems Administrator

WeAreTechWomen

Greater London

Hybrid

GBP 50,000 - 70,000

Full time

2 days ago
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading technology firm is looking for an HPC Systems Administrator, Associate Manager to lead the development of AI-driven compute infrastructures in London. This role involves designing and managing high-performance clusters, supporting AI model training workflows, and optimizing compute resources. The ideal candidate will have expertise in HPC environments, experience with GPU administration, and scripting skills in languages like Python and PowerShell. A competitive salary package is offered, with the requirement to travel to client sites across the UK as needed.

Benefits

Competitive salary package
Opportunities for professional development

Qualifications

  • Expertise in HPC environment, including GPU cluster administration.
  • Proficiency with AI model training workflows and ML frameworks.
  • Advanced analytical and troubleshooting skills.
  • Experience with automation and monitoring platforms.
  • Strong communication and collaboration skills.

Responsibilities

  • Design, deploy, and manage HPC infrastructures.
  • Support AI model training by maintaining compute resources.
  • Monitor and analyze performance metrics.
  • Develop and maintain automation scripts to streamline tasks.
  • Document architecture and processes for compliance and improvement.

Skills

HPC environment expertise
GPU cluster administration
AI/ML frameworks support
Performance tuning skills
Scripting languages proficiency
Strong communication skills

Education

Relevant certifications such as ITIL, NVIDIA DLI, Dell EMC

Tools

PowerShell
Python
Bash
Job description

Job Description

HPC Systems Administrator, Assoc Manager

Salary: Competitive salary and package (Depending on level of experience)

Locations: UK, London (must be willing to travel to client sites throughout the UK on an ad hoc basis)

Accenture are partnering with scaled UK AI compute pioneers to lead the charge on next-generation infrastructure for sovereign AI. To support this endeavor, we’re building a high-performance compute operations team in London.

Our work will be sensitive, secure and on the most up-to-date high density compute stacks available.

Any offer of employment is subject to satisfactory BPSS and SC security clearance which requires 5 years continuous UK address history (typically including no periods of 30 consecutive days or more spent outside of the UK) at the point of application.

Key Responsibilities:
  • Design, deploy, and manage HPC infrastructures including GPU clusters and parallel computing environments.
  • Support AI model training platforms by maintaining compute resources, optimizing scheduling, and ensuring compatibility with AI frameworks and libraries.
  • Monitor, analyse, and fine tune performance metrics addressing bottlenecks or inefficiencies.
  • Develop and maintain automation scripts and tools (e.g., PowerShell, Python, Bash) to streamline operational tasks, monitoring, and reporting.
  • Document architecture, configurations, processes, and resolutions for compliance, knowledge transfer, and continuous improvement. Participate in root cause analysis (RCA) and post-incident reviews for compute or HPC-related incidents, implementing preventive measures as needed.
Required Skills:
  • Expertise in an HPC environment, including GPU cluster administration (e.g., NVIDIA, AMD) and workload schedulers such as SLURM or PBS.
  • Proficiency with AI model training workflows and experience supporting popular AI/ML frameworks (e.g., TensorFlow, PyTorch, CUDA). Solid understanding of networking, storage, and server platforms in both Windows and Linux environments.
  • Advanced analytical, troubleshooting, and performance tuning skills, with the ability to diagnose and resolve complex compute and HPC issues.
  • Experience with automation, monitoring platforms, and scripting languages (e.g., Python, PowerShell, Bash) to enhance operational efficiency.
  • Strong communication and collaboration skills, with a track record of working effectively across technical and non-technical teams. Familiarity with compliance, data security, and best practices for compute and HPC environments.
Qualification:
  • Relevant certifications such as ITIL, NVIDIA DLI, Dell EMC, etc.
Locations:

London

Equal Employment Opportunity Statement:

All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by federal, state, or local law.

Job candidates will not be obligated to disclose sealed or expunged records of conviction or arrest as part of the hiring process.

Accenture is committed to providing veteran employment opportunities to our service men and women.

Please read Accenture’s Recruiting and Hiring Statement for more information on how we process your data during the Recruiting and Hiring process.

About Accenture:

We work with one shared purpose: to deliver on the promise of technology and human ingenuity. Every day, more than 775,000 of us help our stakeholders continuously reinvent. Together, we drive positive change and deliver value to our clients, partners, shareholders, communities, and each other.

We believe that delivering value requires innovation, and innovation thrives in an inclusive and diverse environment. We actively foster a workplace free from bias, where everyone feels a sense of belonging and is respected and empowered to do their best work.

At Accenture, we see well-being holistically, supporting our people’s physical, mental, and financial health. We also provide opportunities to keep skills relevant through certifications, learning, and diverse work experiences. We’re proud to be consistently recognized as one of the World’s Best Workplaces™.

Join Accenture to work at the heart of change. Visit us at www.accenture.com .

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.