Aktiviere Job-Benachrichtigungen per E-Mail!

Site Reliability Engineer

InterEx Group

Dortmund

Remote

EUR 60.000 - 100.000

Vollzeit

Vor 8 Tagen

Erhöhe deine Chancen auf ein Interview

Erstelle einen auf die Position zugeschnittenen Lebenslauf, um deine Erfolgsquote zu erhöhen.

Zusammenfassung

Join a forward-thinking company at the forefront of semiconductor technology, where you will play a pivotal role in enhancing the reliability of distributed computing systems. This position involves troubleshooting, implementing automated testing, and collaborating with diverse teams to ensure seamless operations. You will contribute to the development of next-generation microchips for leading global brands. If you're passionate about innovation and eager to make a significant impact, this opportunity is perfect for you.

Qualifikationen

  • Practical knowledge of distributed computing systems and experience with data centers.
  • Familiarity with CI/CD pipelines and build/release infrastructure.

Aufgaben

  • Enhance VCP reliability and improve system resilience through automation.
  • Translate customer needs into engineering deliverables and manage the technical roadmap.

Kenntnisse

Distributed Computing Systems
Automated Testing
Troubleshooting Networking Issues
Scripting (Python)

Tools

Maven
Nexus
Bamboo
GitHub

Jobbeschreibung

Our client is one of the world’s leading manufacturers of semiconductor chip-making equipment. A majority of the world’s microchips receive their critical lithographic patterning in machines made by this organisation. In addition, they produce metrology tools and advanced applications to analyze and optimize the performance of the customer production process.

Job Mission

Troubleshoot short-term problems and translate, develop into structural improvements on our distributed data and compute platform infrastructure. Be accurate, be precise and help drive up the aggregate availability of the installs of these distributed computing systems in Korea, Taiwan, Israel, China, and the US. Be part of the computing platform that is one of the main pillars under the production of the next-generation microchips of Apple, Samsung, and many others.

Responsibilities :
  1. Create awareness in other teams about methods and procedures we use to help them prevent repetitive help requests.
  2. Help application developers understand the infrastructure, cluster, and system.
  3. Understand and explain how the system fits into the customer’s ecosystem.
  4. Share knowledge and mindset with other teams (dev/infra engineers).
  5. Contribute towards building VCP as a product that meets our quality standards.
  6. Increase stability and reliability of VCP through automated testing and automation.
  7. Enhance customer satisfaction and product reliability.
  8. Improve the functionality and reliability of VCP.
  9. Translate customer ecosystem needs into engineering deliverables.
  10. Identify and resolve system and cluster-level issues.
  11. Integrate individual stories into a comprehensive solution.
  12. Enhance VCP reliability by improving system resilience, including bug fixing and structural improvements.
  13. Implement regression tests and structural fixes to resolve bugs sustainably.
  14. Manage predictable component lifecycle as an ambassador.
  15. Maintain the technical roadmap, including application lifecycle management.
  16. Support feature and service requests from the field.
  17. Suggest and implement improvements to technical solutions and workflows, aligning with team and stakeholder needs.
Highly Valued Qualifications & Experiences :
  1. Experience with data centers and operating systems.
  2. Experience with introducing new technology with zero downtime, including data migration.
  3. Passion for automated testing, qualification, and integration into CI/CD pipelines.
  4. Deep interest in troubleshooting networking issues.
  5. Willingness to work remotely outside regular hours when necessary to build fail-safe systems.
Required Qualifications & Experiences :
  1. Practical knowledge and experience with distributed computing systems.
  2. Experience with build and release infrastructure such as Maven, Nexus, Bamboo, Github.
  3. Familiarity with at least one scripting language, preferably Python.
Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.
eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.