
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A tech company in Penang is seeking a Test Software Development Manager to lead the design, development, and maintenance of diagnostic software and validation frameworks for AI servers. Candidates should have over 7 years of experience in software development, strong debugging skills, and technical leadership abilities. The role involves defining test strategies, overseeing testing processes, and ensuring team competency through structured training. This position offers the chance to work on cutting-edge technologies in a dynamic environment.
The Test Software Development Manager leads the team responsible for designing, developing, and maintaining the diagnostic software, DiagOS, and automated validation framework used to qualify AI servers and racks. This role owns the full test software stack from low‑level hardware access and NVIDIA SDK‑based diagnostics to large‑scale automated system validation in the lab.
The manager will define the test software architecture, translate product and hardware requirements into test strategies, and coach engineers through a structured training plan covering hardware architecture, firmware & diagnostic software development, and test/automation practices.
1. Technical Ownership & Strategy
2. Diagnostic Software & DiagOS Leadership
4. Validation Scope & Quality
6. Cross‑Functional & Partner Collaboration
7. Process, Compliance & Documentation
Bachelor’s or Master’s degree in Computer Science, Software Engineering, Electrical/Computer Engineering, or related field.
7+ years in software development, test automation, or validation engineering, with at least 2–3 years in a lead or managerial role.
Hands‑on experience with Linux, Python (or similar scripting), and CI/CD pipelines for hardware validation.
Exposure to GPU/AI server platforms, diagnostic tools, or data‑center/server validation is strongly preferred.
Strong technical depth in at least two of:
Familiarity with some of the toolchain and concepts in the training plan, e.g. CUDA/NVML/DCGM, i2c‑tools, pciutils, fio, iperf3, stress‑ng, memtest86+, Redfish/IPMI, PXE provisioning, BMC/BIOS basics.
Proven ability to design test strategies and frameworks, not just write individual test cases.
Strong debugging and root‑cause analysis mindset.
Good people leadership skills: coaching, delegation, feedback, and performance management.
Clear written and verbal communication for collaboration with cross‑functional teams and external partners.