Enable job alerts via email!

Systems Engineer (AI/Linux) — Server Deployment

Taknet Systems Pte. Ltd.

Singapore

On-site

SGD 40,000 - 60,000

Full time

Today
Be an early applicant

Job summary

A technology solutions provider in Singapore is seeking an IT Support Associate to assist the IT Support Manager. The role involves deploying high-performance AI servers, maintaining documentation, and ensuring customer satisfaction through effective communication. Candidates should have 1-3 years of experience in Linux and data center operations, along with a strong command of Linux CLI and networking basics.

Qualifications

  • 1–3 years in Linux/servers or data center operations.
  • Comfortable with Linux CLI and basic networking.
  • Familiar with NVIDIA stack and firmware updates.
  • Physically able for data center work.
  • Good documentation habits.

Responsibilities

  • Stage and prepare servers for deployment.
  • Install and verify operating systems and drivers.
  • Apply vendor-approved BIOS and firmware updates.
  • Run imaging flows and follow checklists.
  • Execute burn-in tests and record results.
  • Assist with onsite deployment and customer sign-off.
  • Produce documentation and update templates.

Skills

Linux CLI
Basic networking (VLAN/IP)
SSH
Checklists and scripts (bash/Ansible basics)
Good communication

Education

Polytechnic diploma or degree in relevant field

Tools

NVIDIA stack (drivers, CUDA)
Supermicro platforms
Grafana/Prometheus
Job description
Overview

We deploy high‑performance AI servers for enterprise and public‑sector clients. You’ll assist the IT Support Manager to stage, image, validate and install single‑node and small‑pod systems. The role is hands‑on in the lab and onsite at data centres, with a strong focus on consistency, documentation and speed to customer acceptance.

Location: Singapore; Employment: Full‑time; Reports to: IT Support Manager

Responsibilities
  • Staging & Rack‑ready Prep: Rack, cable, label; capture asset/serial info and maintain inventory sheets.

  • OS & Driver Stack: Install Ubuntu/Rocky Linux, NVIDIA drivers, CUDA/cuDNN; verify with nvidia-smi.

  • Firmware & BIOS: Apply vendor‑approved BIOS/BMC/NIC firmware baselines; keep a simple firmware bill per build.

  • Imaging & Automation: Run PXE/imaging flows and Ansible/bash scripts; follow checklists to prevent drift.

  • Burn‑in & Validation: Execute 24–48h burn‑in; run sanity tests (NCCL ring‑allreduce, fio/iperf, stress/thermals) and record results.

  • Onsite Deployment: Assist with rack‑in, power/network checks, SAT/UAT, and collect same‑day customer sign‑off.

  • Documentation: Produce acceptance packs (serials, firmware bill, test logs, photos); update runbooks and templates.

  • Spares & RMA: Prep spares kits, coordinate RMAs and returns with vendors and logistics.

  • HSSE & DC etiquette: Follow ESD, lifting and DC safety rules; maintain tidy work areas and cabling standards.

Qualifications
  • 1–3 years in Linux/servers or DC operations; poly/degree or equivalent experience.

  • Comfortable with Linux CLI, systemd, basic networking (VLAN/IP), and SSH.

  • Familiar with NVIDIA stack (drivers, CUDA, basic NCCL checks) and firmware updates (BMC/BIOS/NIC).

  • Able to follow and improve checklists and scripts (bash/Ansible basics).

  • Physically able for DC work (rack equipment, cabling); clean documentation habits.

  • Good communication; customer‑facing onsite when required.

Nice‑to‑Haves
  • Experience with Supermicro platforms and Mellanox/ConnectX NICs.

  • PXE/kickstart/cloud‑init experience; Grafana/Prometheus for quick burn‑in dashboards.

  • Basic Windows Server installs for mixed environments.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.