Aktiviere Job-Benachrichtigungen per E-Mail!

Site Reliability Engineer

ZipRecruiter

Hamburg

Remote

EUR 50.000 - 70.000

Vollzeit

Vor 17 Tagen

Erstelle in nur wenigen Minuten einen maßgeschneiderten Lebenslauf

Überzeuge Recruiter und verdiene mehr Geld. Mehr erfahren

Starte ganz am Anfang oder importiere einen vorhandenen Lebenslauf

Zusammenfassung

A leading manufacturer of semiconductor equipment is seeking a skilled engineer to enhance the reliability of their distributed computing systems. Key responsibilities include troubleshooting, improving system resilience, and contributing to product development while collaborating with teams globally. Ideal candidates should have practical experience in distributed systems and a strong automation background.

Qualifikationen

  • Practical knowledge of distributed computing systems.
  • Experience with build and release tools.
  • Proficiency in at least one scripting language (Python).
  • Expertise in Linux.

Aufgaben

  • Create awareness of methods and procedures to prevent repetitive help requests.
  • Assist application developers in understanding the infrastructure.
  • Improve the functionality and reliability of the VCP.

Kenntnisse

Automated testing
CI/CD pipelines
Networking issues
System resilience

Tools

Maven
Nexus
Bamboo
Github
Ansible

Jobbeschreibung

Job Description

Our client is one of the world’s leading manufacturers of semiconductor chip-making equipment. A majority of the world’s microchips receive their critical lithographic patterning in machines made by this organisation. In addition, they produce metrology tools and advanced applications to analyze and optimize the performance of the customer production process.

Job Mission

Troubleshoot short-term problems and translate, develop into structural improvements on our distributed data and compute platform infrastructure. Be accurate, be precise and help drive up the aggregate availability of the installs of these distributed computing systems in Korea, Taiwan, Israel, China, and the US. Be part of the computing platform that is a main pillar in the production of next-generation microchips for companies like Apple, Samsung, and others.

Responsibilities:

  • Create awareness in other teams about methods and procedures we use to help them prevent repetitive help requests.
  • Assist application developers in understanding the infrastructure, clusters, and systems.
  • Understand and explain how the system fits into the customer’s ecosystem.
  • Share knowledge and mindset with other teams (dev/infra engineers).
  • Contribute to building VCP as a product that meets quality standards.
  • Increase stability and reliability of VCP through automated testing and automation.
  • Enhance customer satisfaction and product reliability.
  • Improve the functionality and reliability of VCP.
  • Translate customer ecosystem needs into engineering deliverables.
  • Identify and resolve system/cluster-level issues.
  • Combine individual stories into a comprehensive solution.
  • Make VCP reliable by improving system resilience, bug fixing, and structural improvements.
  • Resolve bugs sustainably by implementing regression tests and structural fixes.
  • Manage component lifecycle predictably.
  • Maintain the technical roadmap (application lifecycle management).
  • Support feature and service requests from the field.
  • Suggest and implement improvements to technical solutions and workflows, aligned with team and stakeholder needs.

Highly valued qualifications & experiences:

  • Experience with DC/OS.
  • Experience with zero-downtime technology introduction and data migration.
  • Passion for automated testing and CI/CD pipelines.
  • Deep understanding of networking issues.
  • Willingness to work remotely outside regular hours when necessary to build fail-safe systems.

Required qualifications & experiences:

  • Practical knowledge of distributed computing systems.
  • Experience with build and release tools: Maven, Nexus, Bamboo, Github.
  • Proficiency in at least one scripting language (Python).
  • Experience with Ansible.
  • Expertise in Linux.
Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.
eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.