Job Summary
As a key role within the AI service division, this position is responsible for supporting the design, deployment, and maintenance of AI infrastructure. Candidates must possess rich practical experience, with a strong focus on the Nvidia GPU ecosystem.
Job Responsibilities
- Pre-sale Tech. Support
- Responsible for pre-sales technical support for AI servers and related hardware products, including demand research, solution design, performance evaluation, and technical communication.
- Gain in-depth understanding of customers' AI application scenarios (such as large model training, inference, HPC computing, data analysis, etc.) and provide optimal server configuration and system architecture solutions.
- Assist the sales team in preparing pre-sales technical documents, bidding documents, solution PPTs, and conduct technical demonstrations and presentations.
- Participate in POC testing, performance optimization, and technical verification to ensure solutions meet customers' business objectives.
- Deployment Support
- Install AI infrastructure at designated locations with or without partners, including Physical Deployment, Logical Configuration, and Operation Enabling.
- Physical Deployment: Physical onsite network and computer installation; Cabling and labelling.
- Logical Configuration: Network and computer logical configuration; System testing and benchmarking.
- Operation Enabling: Deliver configuration and operation documents; Setup fundamental operation tools, e.g., monitoring.
- Maintenance Support
- Diagnose the detected issues of AI infrastructure and identify the root cause.
- Fix the failures/errors by H/W replacement, software upgrade or other workarounds.
- Escalate to vendor if requires extra support.
Qualifications and Skills
- Bachelor's degree or above in Computer Science, Electronics, Communications, Information Engineering, or related fields.
- More than 5 years of work experience in server OEM or related fields (such as AI computing, big data platforms, HPC).
- Familiar with mainstream server hardware architectures (CPU/GPU, memory, storage, network), and have in-depth understanding of computing platforms such as NVIDIA, Intel.
- Experience in deploying and/or maintaining large-scale Nvidia AI infrastructure, including one of these three products (B200, GB200 or GB300).
- Have good team spirit, learning ability, and customer service awareness.
- Good English reading and writing skills, able to read English technical documents and conduct basic oral communication.