Enable job alerts via email!

System Administrator (Slurm)

PRAGMATIKE

Ottawa

Remote

CAD 80,000 - 100,000

Full time

11 days ago

Job summary

A technology firm specializing in research infrastructure is seeking a System Administrator to manage and scale HPC clusters. This fully remote role requires expertise in Slurm and Linux systems. The ideal candidate will thrive in a fast-paced environment and support research teams with their compute-heavy workflows. Join a collaborative team committed to innovation and technology.

Qualifications

  • 3+ years of experience as a Systems Administrator or similar role.
  • Strong experience with Slurm workload manager and cluster administration.
  • Solid knowledge of Linux system internals, storage, and networking.
  • Proven experience supporting researchers or scientific computing teams.
  • Familiarity with configuration management tools and scripting.
  • Comfortable working independently in a remote environment.
  • Strong problem-solving and communication skills.

Responsibilities

  • Administer and maintain internal HPC clusters.
  • Collaborate with research teams for compute-heavy workflows.
  • Ensure high availability and performance of infrastructure systems.
  • Monitor, troubleshoot, and resolve system and network issues.
  • Manage user accounts, security, and access controls.
  • Automate tasks and improve system processes through scripting.
  • Maintain and document configurations and procedures.
  • Participate in infrastructure upgrades and scaling initiatives.
  • Provide expert support for Linux-based systems.

Skills

Linux system internals
Slurm workload manager
Cluster administration
Bash scripting
Python scripting
Problem-solving
Communication skills

Tools

AWS
GCP
Monitoring tools
Security and access management systems
Job description
Overview

We are hiring at Pragmatike to expand our team and support the growth of our internal projects. Our focus is on developing cutting-edge solutions in Cloud Computing, Blockchain, and Artificial Intelligence, while fostering a culture of collaboration and innovation. Joining us means being part of a passionate team where your ideas and skills directly contribute to shaping tomorrow's technologies.

We are currently looking for a System Administrator to help manage and scale our infrastructure. The role is ideal for someone who thrives in a fast-paced research and development environment, has experience managing HPC clusters, and is comfortable working closely with researchers. If you're passionate about systems, automation, and high-performance computing, we’d love to meet you.

Languages: English is mandatory

Type: Full-Time (40h / week)

Location: Fully remote, EST preferred (+ / -3h CET)

Start date: ASAP

Responsibilities
  • Administer and maintain internal HPC clusters using Slurm for workload management.
  • Collaborate with research teams to support compute-heavy workflows.
  • Ensure high availability and performance of infrastructure systems.
  • Monitor, troubleshoot, and resolve system and network issues.
  • Manage user accounts, system security, and access controls.
  • Automate routine tasks and improve system processes through scripting.
  • Maintain and document configurations, procedures, and system changes.
  • Participate in infrastructure upgrades and scaling initiatives.
  • Provide expert support for Linux-based systems in a hybrid cloud environment.
Qualifications
  • 3+ years of experience as a Systems Administrator or similar role.
  • Strong experience with Slurm workload manager and cluster administration. Slurm is a must.
  • Solid knowledge of Linux system internals, storage, and networking.
  • Proven experience supporting researchers or scientific computing teams.
  • Familiarity with configuration management tools and scripting (Bash, Python, etc.).
  • Comfortable working independently in a remote, asynchronous environment.
  • Strong problem-solving and communication skills.
Industry Knowledge

AI focus; enterprise-grade research infrastructure; high-performance computing environments.

Tools & Technologies

Linux, Slurm, HPC clusters, Bash / Python, cloud infrastructure (e.g., AWS / GCP), monitoring tools, security and access management systems.

Pragmatike is an Equal Opportunity Employer and is committed to providing equal employment opportunities to all applicants without discrimination. We recruit on behalf of our clients and prohibit discrimination and harassment based on race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. We are committed to a fair and inclusive hiring process. We process your personal data solely for recruitment purposes, in accordance with applicable privacy laws, and maintain reasonable safeguards to protect your information. Your data may be shared with our client(s) for hiring consideration, but will not be disclosed to third parties outside of the recruitment process.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.