Job Search and Career Advice Platform

Enable job alerts via email!

HPC Storage Engineer (System), NSCC

A*STAR RESEARCH ENTITIES

Singapore

On-site

SGD 80,000 - 100,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading research organization in Singapore is looking for an experienced HPC Storage Engineer to manage and optimize the storage infrastructure in high-performance computing environments. The role involves ensuring system reliability, performance testing, and collaborating with cross-functional teams. Candidates should have a degree in a relevant field and solid Linux skills with experience in parallel file systems like Lustre or GPFS. This position offers the chance to work in an innovative environment focused on advanced technology solutions.

Qualifications

  • Strong Linux skills and comfort with command-line interface.
  • At least 2 years of experience managing parallel file systems.
  • Familiar with RDMA-based interconnect technologies.

Responsibilities

  • Administer and optimize storage infrastructure.
  • Ensure high availability and reliability of storage systems.
  • Provide technical support and resolve storage-related issues.

Skills

Linux skills
Scripting with Bash and/or Python
Problem-solving

Education

Degree in Computer Science, Engineering, IT or related fields

Tools

Lustre
GPFS
BeeGFS
Job description
Job Summary

The HPC Storage Engineer will be responsible for managing the storage infrastructure within HPC environments. This role involves monitoring storage performance and optimizing through tuning and troubleshooting.

Responsibilities
  • Storage administration and optimization
  1. Collaborate with Managed Services teams to administer and support HPC storage infrastructure.
  2. Ensure high availability and reliability of storage systems.
  3. Provide technical support and resolve storage-related issues.
  4. Implement best practices for monitoring, alerting, and reporting.
  5. Track utilization and allocation trends to support capacity planning.
  6. Conduct performance testing and analysis of storage systems.
  7. Work with cross-functional teams to enhance performance and scalability.
  8. Maintain comprehensive documentation of infrastructure and operational processes.
  • Data management— Optimize data placement strategies for performance and efficiency.
  • Designing and planning— Support future storage expansion and HPC system design.
Qualifications
  • Degree in a Computer Science, Engineering, IT or other relevant areas.
  • Strong Linux skills and comfort with command-line interface.
  • Solid understanding of Linux file systems, including local (e.g., ext4, XFS), shared (e.g., NFS), and parallel (e.g., Lustre, GPFS, BeeGFS) file systems.
  • At least 2 years of experience managing parallel file systems such as Lustre, GPFS, BeeGFS, or similar technologies.
  • Familiar with RDMA-based interconnect such as InfiniBand or RoCE.
  • Proficient in scripting with Bash and/or Python.
  • Strong problem-solving skills and ability to troubleshoot complex issues.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.