Essential Duties and Responsibilities
As an AI Technician, you will serve as the Directly Responsible Individual (DRI) for daily operations within the data center.
You will lead hands-on installation, maintenance, and troubleshooting of compute and network infrastructure critical to Groq’s high-performance AI workloads.
- Hardware Operations
- Receive, unpack, and move servers and other equipment to the data center floor.
- Install, cable, and maintain servers, network switches, and power distribution units (PDUs) in racks.
- Perform hardware-level bring-up and testing using Linux command-line tools.
- Ensure proper accountability for equipment and assets through inventory management.
- Troubleshooting & Support
- Troubleshoot and resolve complex technical issues related to rack and node failures.
- Run scripts to debug and repair rack cabling and other hardware problems.
- Create, update, and resolve tickets in Groq's ticketing system to document all work.
- Participate in an on-call rotation to provide 24 / 7 support for data center operations.
- Infrastructure & Collaboration
- Execute final test sign-offs for newly built racks.
- Collaborate with other engineering teams to design and implement data center upgrades and expansions.
- Develop and maintain technical documentation, including diagrams and procedures, to ensure operational consistency.
- Ensure compliance with data center standards, policies, and procedures.
- Ideal Candidates Have
- 1-2 years years of experience in data center operations or a related field
- Strong knowledge of data center infrastructure, including servers, storage systems, and network devices
- Experience with data center management software, such as DCIM or BMS
- Strong problem-solving and analytical skills
- Excellent communication and teamwork skills
- Ability to work in a fast-paced environment and prioritize tasks effectively
- Strong attention to detail and ability to maintain accurate records
- Experience with scripting languages, such as Python or Bash
- Familiarity with virtualization technologies, such as Kubernetes
- Advanced fiber optic cabling skills
- Intermediate Linux skills
- Intrinsic curiosity and drive to stay up-to-date with the latest technologies and trends in data center infrastructure and operations
- Familiarity with Macbooks, Slack and Google docs
- Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent experience
- Ability to travel up to 20% of the time
- Candidates should have 1-2 years of experience in data center operations or a related field, with strong knowledge of data center infrastructure.
- They should also possess problem-solving skills, experience with scripting languages, and familiarity with virtualization technologies.