Ativa os alertas de emprego por e-mail!

Linux Infrastructure Senior Specialist

Equinix

São Paulo

Presencial

BRL 120.000 - 160.000

Tempo integral

Há 28 dias

Resumo da oferta

A leading company in digital infrastructure is seeking a Linux Infrastructure Senior Specialist in São Paulo. The role involves managing critical Linux environments, focusing on AI and Machine Learning projects. Candidates should have substantial experience in Linux administration, AI workloads, and advanced problem-solving skills. This position offers a chance to work in a dynamic, diverse environment with a commitment to growth and sustainability.

Qualificações

  • Experience in Linux system administration in mission-critical environments.
  • Advanced English proficiency required.
  • Familiarity with AI workload environments and troubleshooting needed.

Responsabilidades

  • Administer Linux environments ensuring high availability and performance.
  • Support infrastructure for AI workloads, including GPU configurations.
  • Conduct root cause analysis of incidents and standardize environments.

Conhecimentos

Linux system administration
AI workload support
Advanced troubleshooting
Performance analysis
Technical documentation
Collaboration

Ferramentas

Kubernetes
Docker
NGFW appliances

Descrição da oferta de emprego

Who are we?

Equinix is the world’s digital infrastructure company, operating over 260 data centers across the globe. Digital leaders harness Equinix's trusted platform to bring together and interconnect foundational infrastructure at software speed. Equinix enables organizations to access all the right places, partners, and possibilities to scale with agility, speed the launch of digital services, deliver world-class experiences, and multiply their value, while supporting their sustainability goals.

Our culture is based on collaboration and the growth and development of our teams. We hire hardworking people who thrive on solving challenging problems and give them opportunities to hone new skills and try new approaches, as we grow our product portfolio with new software and network architecture solutions. We embrace diversity in thought and contribution and are committed to providing an equitable work environment that is foundational to our core values as a company and is vital to our success.

Job Summary

We are looking for a Linux Infrastructure Senior Specialist to work in critical and high-performance environments, focusing on infrastructure support for critical environments and projects, including Artificial Intelligence.

Responsibilities

  1. Administer Linux environments (RHEL, CentOS, Ubuntu, and Suse), ensuring high availability and performance.
  2. Support infrastructure that serves AI and Machine Learning workloads, including configuring environments with GPU support (NVIDIA, CUDA).
  3. Perform advanced troubleshooting, performance analysis, and provide second and third level technical support.
  4. Collaborate with global teams, participate in meetings, and deliver technical solutions in English.
  5. Conduct root cause analysis of incidents.
  6. Produce documentation related to the environment (RCA, KBs, etc.).
  7. Standardize environments by acting on incidents and requests relating to their specialty.
  8. Plan and implement changes, migrations, and improvements based on a change plan.
  9. Collect information on managed environments to meet internal demands.
  10. Maintain and update documentation related to the client's production environment.
  11. Operate, maintain, and document customer equipment with a management product (Managed Services) following policy.
  12. Identify opportunities for improvement and new business opportunities.
  13. Analyze the production environment to propose improvements.
  14. Provide telephone support for incidents, requests, and change preparations.
  15. Support first-level service teams with technical guidance or escalate demands.
  16. Participate in planning and implementing HPC solutions, software integration, and system administration for new clients.

Qualifications

  • Experience in Linux system administration in mission-critical environments, especially in HPC or AI.
  • Experience with AI workload environments, including storage integration and troubleshooting (SAN, etc.).
  • Experience with NGFW appliances from Fortinet, Juniper, Cisco, Checkpoint, and Sophos.
  • Advanced English proficiency (oral and written).
  • Experience with schedulers such as SLURM, LSF, PBS, etc.
  • Knowledge of Kubernetes and/or Docker.
  • N2 and N3 service experience, working across countries.
  • Experience troubleshooting connectivity incidents using testing tools.
  • Availability for working hours.

Differentials

  • Certifications like RHCSA, RHCE, LFCS, Docker Certified Associate, CKA.
  • Knowledge of Fortigate, CUDA, JuniperOS, EMC OS, Cumulus OS, Docker, Kubernetes, Infiniband MELLANOX, NVIDIA DGX (A100, H100, GB200), NVIDIA Base Command Manager (v10, v11).

Equinix is an equal opportunity employer. All candidates will be considered for employment regardless of race, color, religion, creed, national or ethnic origin, ancestry, place of birth, citizenship, sex, pregnancy/childbirth or related medical conditions, sexual orientation, gender identity or expression, marital status or partnership, age, veteran or military status, physical or mental disability, medical condition, political/organizational affiliation, or any other status protected by law.

We are committed to an inclusive employment process. If you need assistance or accommodation, please complete this form.

Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.