Team Lead for Data Center Server Service Engineering
Key Responsibilities
- Team Leadership & Operational Oversight:
- Manage and schedule rotating team of service engineers (day/night/weekend shifts)
- Conduct onboarding, training, and upskilling programs for L1–L3 engineers
- Perform regular performance reviews, mentoring, and coaching
- Lead daily stand-ups and weekly operations sync meetings
- Server Hardware Operations:
- Supervise installation, commissioning, and decommissioning of rack, blade, and GPU servers
- Oversee preventive maintenance (PM) and break-fix procedures across all hardware platforms
- Lead critical hardware incident investigations and Root Cause Analyses (RCA)
- Firmware, BIOS & BMC Management:
- Coordinate batch firmware upgrades via tools such as SeaChest, IDRAC, Redfish, or APIs
- Maintain firmware compliance logs and version control documentation
- Interface with vendor support for RMA and escalation
- Compliance, Documentation & Process Control:
- Ensure SOP adherence for hardware lifecycle processes and ITIL-aligned incident/change management
- Maintain inventory accuracy in DCIM/CMDB platforms
- Support ISO 27001 / TIA-942 / Uptime compliance
Required Qualifications
- Bachelor's degree in Electrical Engineering, Computer Science, or related field
- 3+ years experience in server infrastructure operations, with 1+ years in a team lead role
- In-depth knowledge of server platforms and components
- Experience with hardware management tools and interfaces
- Understanding of DC safety and cable management
Preferred Qualifications
- Certified in Server+, DCIM+, or equivalent
- Experience with GPU servers and liquid-cooled infrastructure
- Working knowledge of ITIL, ISO 27001, or Uptime Tier III/IV
- Familiarity with automation or monitoring tool integrations
Reporting Line
Reports to Data Center Operations Manager / Infrastructure Services Director
Travel
Occasional travel to remote data center sites or vendor coordination visits may be required.