About the Company
Shanghai Shuoyao Technology is a global leader in AI computing infrastructure, providing efficient, stable, and secure computing power for AI training and inference. With deep expertise in deploying and managing large-scale, high-performance computing clusters worldwide, we ensure 24/7 operational excellence through our intelligent O&M platform and international support network. As we expand our footprint in Malaysia, we are building a diverse and collaborative team where every member's contribution is valued.
About the Role
We are looking for a meticulous and proactive Data Center IT Operations Engineer to join our growing team. You will be at the frontline of ensuring the physical health and operational readiness of our AI computing servers. This role is central to maintaining our high availability standards, involving hands‑on hardware maintenance, system provisioning, and rapid response to incidents. It is a fantastic opportunity to build deep technical expertise in a cutting‑edge, high‑density computing environment.
Key Responsibilities
- Perform daily inspections and monitoring of air‑cooled and liquid‑cooled server racks, documenting hardware status, temperatures, and performance metrics.
- Execute full lifecycle physical server management: from initial racking, cabling, and deployment (OS installation via PXE/automation tools) to decommissioning, removal, and hardware refreshes.
- Independently diagnose, troubleshoot, and replace faulty hardware components (e.g., disks, memory, PSUs, NICs, GPU cards) to minimize system downtime.
- Perform basic network troubleshooting (link status, IP configuration) and collaborate closely with network and system administration teams for complex issues.
- Liaise effectively with hardware vendors and technical support to escalate and resolve complex hardware failures, ensuring swift restoration of service.
- Maintain accurate documentation of all activities, update standard operating procedures (SOPs), and contribute to a shared knowledge base of fault cases and solutions.
- Continuously identify and suggest improvements to workflows, tooling, and processes to enhance data center operational efficiency and safety.
Qualifications & Experience
- Diploma or Bachelor’s degree in Computer Engineering, Information Technology, or a related technical field.
- 2-3 years of hands‑on experience in a data center environment, specifically in server hardware maintenance and physical layer operations.
- Experience in a cloud service provider data center, HPC environment, or the IT infrastructure team of a technology company is highly preferred.
- Familiarity with the complete server lifecycle within a data center context is a strong advantage.
Required Skills & Competencies
- Technical Proficiency: Practical experience installing, configuring, and troubleshooting CentOS, Ubuntu, or other major Linux distributions. Skilled in RAID configuration and management.
- Hardware Knowledge: Solid understanding of server architecture (x86), and components. Ability to safely handle and replace hardware following strict ESD and safety protocols.
- Process‑Oriented: Meticulous attention to detail for logging, labeling (asset, network ports), and following documented procedures for all rack‑and‑stack activities.
- Problem‑Solving: A logical and systematic approach to diagnosing hardware issues under pressure.
- Team Player: Excellent communication skills (verbal and written in English) to collaborate with local and international teams. Responsible, reliable, and adaptable to shift work or on‑call schedules as needed.
We Offer
- Hands‑On Expertise: Deepen your skills with some of the world’s most advanced AI computing hardware in a real‑world production environment.
- Growth Pathway: Clear opportunities for technical advancement into senior engineering or specialization roles (e.g., in liquid cooling, DCIM automation).
- Global & Inclusive Culture: Become part of a supportive, diverse, and professionally driven team where knowledge sharing is encouraged.
- Competitive Package: A market‑competitive salary and benefits package commensurate with your experience.
Equal Opportunity Employer
We are proud to be an equal‑opportunity employer. We hire, develop, and retain talent based on merit and business needs, and are committed to creating a workplace where everyone feels they belong.