Enable job alerts via email!

SRE - Kubernetes

TN Ireland

Dublin

On-site

EUR 70,000 - 110,000

Full time

12 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Principal Kubernetes Site Reliability Engineer to join their innovative Digital Assets Technical Operations Team. In this pivotal role, you will oversee the design of a robust, cloud-based deployment on AWS’s Kubernetes Platform, ensuring high availability and security across multiple regions. Your expertise in AWS and Kubernetes will be key in crafting resilient infrastructures and leading a team of skilled engineers. This is a fantastic opportunity to work in a dynamic environment where your contributions will directly impact the company's success in the financial services sector. If you thrive in fast-paced settings and are passionate about cloud technologies, this role is perfect for you.

Qualifications

  • Several years of hands-on experience with AWS in a production environment.
  • Production experience running Kubernetes workloads on AWS using EKS.

Responsibilities

  • Design a multi-region, highly available cloud-based deployment on AWS's Kubernetes Platform.
  • Provide technical leadership to teams of Site Reliability Engineers.

Skills

AWS
Kubernetes
Helm
AWS CloudFormation
Monitoring tools (Cloudwatch, Datadog, Splunk)
Unix operating systems
Python
CDN Providers (Akamai)
Agile software development

Tools

AWS EKS
Cloudwatch
Datadog
Splunk
Kibana

Job description

Global Investment/Financial Services company is looking to hire an experienced Principal Kubernetes Site Reliability Engineer as part of their Digital Assets Technical Operations Team. You will work with various engineering teams to own the design of a new multi-region, highly available, cloud-based deployment of our applications to AWS’s Kubernetes Platform (EKS).

Experience
  • Several years of hands-on experience with AWS in a production environment
  • Production experience running Kubernetes workloads on AWS using EKS
  • Experience creating and deploying Helm charts & libraries
  • Specialist in AWS CloudFormation, IAM, VPC and network security
  • Experience with monitoring tools e.g. Cloudwatch, Datadog, Splunk
  • Proficiency with Unix operating systems and shell scripting
  • Programming experience, e.g. Python, preferred
  • Experience with CDN Providers e.g. Akamai, preferred
  • Experience with the agile software development lifecycle preferred
Skills
  • Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO
  • Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog, Splunk, and Kibana
  • See problems as opportunities to automate
  • Ability to work independently with minimal direction
  • Coordinate the overall design of highly available, secure, scalable microservices-based applications in AWS
  • Track record of providing technical leadership to strong teams of Site Reliability Engineers
  • Experience with configuring and deploying resilient infrastructure in multiple regions and multiple availability zones
  • Work multi-functionally with other organizations and collaborate with our risk, product and engineering team leaders
  • Ability to communicate at all levels with a track record of strong written and verbal communications

To apply and find out more please reach out.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.