Job Summary
As a key member of the AI Services Division, the AI Infrastructure Project Manager will oversee the design and deployment of AI infrastructure projects.
This role is responsible for managing cross-functional teams and collaborating with external partners to ensure project delivery within defined cost, schedule, and quality parameters.
The ideal candidate will possess deep technical understanding of AI computing platforms—particularly the NVIDIA GPU ecosystem—combined with strong project management expertise.
Key Responsibilities
1. Pre-Sales Technical Support
- Lead the pre-sales solution design process for AI servers and related hardware systems.
- Oversee Proof of Concept (POC) testing, performance optimization, and solution validation to ensure alignment with customers’ business and technical objectives.
2. Deployment Project Management
Manage end-to-end AI infrastructure deployment projects, ensuring seamless execution through:
- Leading teams to perform design reviews and site surveys.
- Overseeing point-to-point cabling plan development and validation.
- Developing comprehensive project plans, including timelines, resource allocation, and risk mitigation strategies.
- Conducting project kick-off meetings and coordinating with stakeholders across multiple levels.
- Managing project budgets, schedules, risks, and issues to ensure successful delivery.
- Driving clear communication across all project tiers—internal teams, customers, and partners.
- Ensuring timely and quality project deliverables that meet agreed specifications.
3. Maintenance and Support
- Diagnose and resolve technical issues within AI infrastructure environments.
- Implement corrective actions through hardware replacement, software upgrades, or appropriate workarounds.
- Escalate issues to vendors or partners when advanced support is required.
Qualifications and Skills
- Bachelor’s degree or higher in Computer Science, Electronics, Communications, Information Engineering, or a related discipline.
- Minimum 5 years of experience in IT systems integration or infrastructure project management.
- Strong familiarity with server hardware architectures (CPU/GPU, memory, storage, and networking).
- In-depth understanding of computing platforms such as NVIDIA, Intel, and AMD.
- Hands-on experience managing or contributing to large-scale NVIDIA AI infrastructure deployments, including any of B200, GB200, or GB300.
- Excellent leadership, problem-solving, and stakeholder management skills.
- Strong teamwork orientation, adaptability, and customer-focused mindset.
- Proficiency in English, with the ability to read technical documents and conduct professional communications.
- Project management certification such as PMP, PRINCE2, or an equivalent is required.