Enable job alerts via email!

Computer System Engineer (Windows, Linux)

KLA-Tencor (Singapore) Pte Ltd

Singapore

On-site

SGD 70,000 - 90,000

Full time

Today
Be an early applicant

Job summary

A leading technology firm in Singapore is seeking a Computer Systems Engineer for their HPC Team. You will design and engineer HPC clusters, support product transitions, and drive innovations in computational technologies. The ideal candidate has strong Linux knowledge, experience with HPC hardware, and a DevOps focus. The role requires excellent problem-solving abilities and teamwork.

Qualifications

  • In-depth knowledge of Linux systems like Suse, RedHat, and Ubuntu.
  • Experience with HPC hardware and robust storage management.
  • Ability to develop Shell and Python scripts.
  • Familiarity with continuous integration and deployment pipelines.

Responsibilities

  • Support high-performance compute clusters focusing on CPU/GPU architecture.
  • Generate hardware BOMs and manage vendor activities.
  • Utilize Linux OS for system configuration and requirements specification.
  • Drive projects to ensure on-time achievement of goals.
  • Support product design and release with documentation.

Skills

Linux systems knowledge
HPC hardware knowledge
Shell and Python scripting
TCP/IP fundamentals

Education

BS or MS degree in Computer or Electrical Engineering

Tools

Jenkins
Docker
Kubernetes
Git
Job description
Overview

Role Summary:

LS-SWIFT HPC Team is charted to provide pioneering High Performance Computing solutions in enabling-Image processing algorithms for Reticle/Photomask inspections in real time. As a computer system engineer, you would Design and Engineer an embedded HPC Cluster which is a critical sub-system in KLA inspection tool. Your primary responsibilities include leading and driving efforts to prototype groundbreaking computational technologies, optimize cost, computation time and collaborate with peers to transition prototypes to production. Additional job responsibilities include qualification of the technology stack, diagnostics development, enhance monitoring and observability and support critical issues for new and legacy products.

Responsibilities
  • Support of high-performance compute clusters. Working knowledge on HPC systems, including CPU/GPU architecture, scalable/robust storage, high-bandwidth inter-connects, and a knowledge of cloud-based computing architectures.

  • Generate HW BOMs for the HPC Clusters, provide vendor management and oversee HW release activities.

  • Utilize Linux OS to configure appropriate operating systems for the HPC system. Understand and assemble the project specifications and performance requirements at the subsystem and system levels.

  • Adhere and drive to project timelines to ensure program achievements complete on time.

  • Support design and release of new products to manufacturing and ultimately the customer, providing quality golden images, procedures, scripts and documentation to the manufacturing team and customer support team.

Attitude
  • Looking for individuals who are inquisitive, thrives on challenge, enjoy problem solving and have excellent written & verbal skills.

Required Qualifications
  • Validated in-depth and flavor agnostic knowledge of Linux systems (Suse, RedHat, Rocky, Ubuntu).

  • Experience with maintaining and interacting with robust storage. Working HPC HW knowledge especially in the server, GPU, networking, Storage, BIOS & BMC arenas.

  • Experience in System-D, Netboot/PXE, Linux HA.

  • Strong understanding of TCP/IP fundamentals and knowledge of protocols, DNS, DHCP, HTTP, LDAP, SMTP.

  • Ability to code and develop Shell and Python scripts.

  • Experience with one or more of the listed Configuration Mgmt utilities. (Salt, Chef, Puppet etc).

Preferred Qualifications
  • Possess a strong DevOps focus: Knowledge of setting up a continuous development pipeline (Jenkins), Repository software (Git-based), Singularity & Docker Containers. Kubernetes, Prometheus & Grafana experience. Knowledge of Apache/Nginx, setting up proxy/reverse proxy, application server routing, load balancing (HAProxy).

  • BS or MS degree + 3 to 5 years validated experience in Computer Engineering or Electrical Engineering related fields.

Skills and Abilities
  • Team Orientation & Interpersonal – Highly motivated teammate with ability to develop and maintain collaborative relationships with all levels within and external to the organization.

  • Organization & Time Management – Able to plan, schedule, organize, and follow up on tasks related to the job to achieve goals within or ahead of established time frames.

  • Multi-task - Ability to expeditiously organize, coordinate, manage, prioritize, and perform multiple tasks simultaneously to swiftly assess a situation, determine a logical course of action, and apply the appropriate response.

  • Adaptability to Change – Able to be flexible and supportive, and able to assimilate change positively and proactively in rapid growth environment. Outstanding teammate with excellent written and verbal communications skills.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.