Mission
Installation, configuration, and maintenance of data center infrastructure, including servers, storage systems, and network devices.
AI Technician
Essential Duties and Responsibilities:
As an AI Technician, you will serve as the
Directly Responsible Individual (DRI) for daily operations within the data center. You will lead hands-on installation, maintenance, and troubleshooting of compute and network infrastructure critical to Groq’s high-performance AI workloads.
Your Responsibilities Will Include
- Hardware Operations:
- Receive, unpack, and move servers and other equipment to the data center floor.
- Install, cable, and maintain servers, network switches, and power distribution units (PDUs) in racks.
- Perform hardware-level bring-up and testing using Linux command-line tools.
- Ensure proper accountability for equipment and assets through inventory management.
- Troubleshooting & Support:
- Troubleshoot and resolve complex technical issues related to rack and node failures.
- Run scripts to debug and repair rack cabling and other hardware problems.
- Create, update, and resolve tickets in Groq's ticketing system to document all work.
- Participate in an on-call rotation to provide 24/7 support for data center operations.
- Infrastructure & Collaboration:
- Execute final test sign-offs for newly built racks.
- Collaborate with other engineering teams to design and implement data center upgrades and expansions.
- Develop and maintain technical documentation, including diagrams and procedures, to ensure operational consistency.
- Ensure compliance with data center standards, policies, and procedures.
Ideal Candidates Have
- 1-2 years years of experience in data center operations or a related field
- Strong knowledge of data center infrastructure, including servers, storage systems, and network devices
- Experience with data center management software, such as DCIM or BMS
- Strong problem-solving and analytical skills
- Excellent communication and teamwork skills
- Ability to work in a fast-paced environment and prioritize tasks effectively
- Strong attention to detail and ability to maintain accurate records
- Experience with scripting languages, such as Python or Bash
- Familiarity with virtualization technologies, such as Kubernetes
- Advanced fiber optic cabling skills
- Intermediate Linux skills
- Intrinsic curiosity and drive to stay up-to-date with the latest technologies and trends in data center infrastructure and operations
- Familiarity with Macbooks, Slack and Google docs
- Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent experience
- Ability to travel up to 20% of the time