Enable job alerts via email!

Head of System & Operation (GPU System)

EPS Consultants

Kuala Lumpur

On-site

MYR 120,000 - 180,000

Full time

20 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking an IT Operations Manager to oversee the design and maintenance of GPU resources. The ideal candidate will have over 10 years of experience in technology leadership, with a strong understanding of cloud computing and GPU technologies. This role involves strategic planning, team leadership, and vendor management to enhance operational efficiency and service quality.

Qualifications

  • 10+ years in IT operations or technology leadership.
  • Strong understanding of GPU technologies and cloud computing.
  • Experience managing complex IT systems.

Responsibilities

  • Oversee design and maintenance of IT systems for GPU resources.
  • Lead and mentor a diverse team of technology professionals.
  • Manage relationships with vendors to ensure compliance.

Skills

Analytical skills
Troubleshooting skills
Interpersonal skills
Strategic thinking
Communication skills

Education

Bachelor’s degree in Computer Science

Tools

Kubernetes
GPU technologies
CPU/GPU clusters

Job description

Job Responsibilities
  1. Oversee the design, implementation, and maintenance of IT systems supporting operational activities, ensuring high availability and performance of GPU resources.
  2. Provide technical guidance across complex infrastructure projects.
  3. Develop and execute operational strategies aligned with the company’s goals for GPU-as-a-Service, focusing on scalability, efficiency, and reliability.
  4. Lead and mentor a diverse team of technology professionals, fostering a culture of innovation, accountability, and continuous improvement.
  5. Manage relationships with key vendors and third-party service providers to ensure compliance with SLAs and industry standards.
  6. Identify process improvement opportunities across operations and implement best practices to enhance productivity, reduce costs, and improve service quality.
  7. Collaborate with product development, sales, and marketing teams to ensure seamless service integration and alignment with customer needs.
  8. Ensure compliance with relevant laws, regulations, and industry standards related to data protection and service delivery.
Minimum Requirements
  • Malaysian nationality.
  • Bachelor’s degree in Computer Science or a related technical field.
  • Proven experience of 10+ years in operations or technology leadership within the IT or cloud services industry.
  • Strong understanding of GPU technologies and cloud computing principles.
  • Experience managing complex IT systems and operational processes.
  • Exceptional analytical and troubleshooting skills.
  • Knowledge of Kubernetes environments and debugging capabilities.
  • Familiarity with energy-efficient computing and sustainable data center operations.
  • Ability to manage priorities in a dynamic, fast-paced environment.
  • Hands-on expertise with CPU/GPU clusters and platforms.
  • Excellent communication skills for technical and non-technical audiences.
  • Strong interpersonal skills for developing professional relationships across teams.
  • Proven ability to manage multiple projects with attention to detail.
  • Knowledge of operating and managing CPU/GPU cluster processes.
  • Strategic thinking and ability to implement innovative solutions.
  • Excellent documentation skills for technical designs, issues, and procedures.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.