Aktiviere Job-Benachrichtigungen per E-Mail!
Erhöhe deine Chancen auf ein Interview
Erstelle einen auf die Position zugeschnittenen Lebenslauf, um deine Erfolgsquote zu erhöhen.
An established industry player is seeking a Site Reliability Engineer to lead operations for the European Sovereign Cloud initiative. This role involves collaborating with technology leaders, enhancing AWS services, and ensuring high availability for EU customers. You will tackle production system challenges, implement improvements, and participate in on-call rotations. Join a dynamic team dedicated to innovation and operational excellence, where your contributions will directly impact the future of cloud services in Europe. If you're passionate about cloud operations and eager to make a difference, this opportunity is for you!
You will need to login before you can apply for a job.
Sector: Operations and Facilities Management
Contract Type: Permanent
Hours: Full Time
DESCRIPTION
AWS is set to introduce the inaugural European Sovereign Cloud (ESC), marking a significant development in Utility Computing (UC). To spearhead this initiative, we are actively seeking experienced systems development engineers with a strong background in cloud operations. As part of the AWS Managed Operations team, you will play a pivotal role in building and leading operations and development teams dedicated to delivering high-availability AWS services, including EC2, S3, Dynamo, Lambda, and Bedrock, exclusively for EU customers.
Key job responsibilities include overseeing the launch of the ESC in 2025, working closely with global AWS teams, and influencing the evolution of AWS services and technology. A typical day in this role involves collaborating with technology leaders, contributing to the enhancement of day-to-day operations, and ensuring improvements in availability, reliability, latency, performance, and efficiency of the ESC.
A day in the life involves operating production systems and working with your team to make long-term improvements to the reliability, availability, and performance of those software systems. You will root cause why some deployments recently failed and fix those bugs. You may also execute highly sensitive time-critical changes to production and participate in design discussions, writing code, and providing actionable feedback to your team's code.
You will be required to occasionally participate in "on-call" rotations to resolve incidents occurring out-of-hours.
Eligibility requirements
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills.