Enable job alerts via email!

Senior Site Reliability Engineer - AWS Kubernetes

Source Technology

London

On-site

GBP 70,000 - 90,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading global financial services provider seeks a Senior Site Reliability Engineer to establish a new team. This role involves designing and managing infrastructure solutions, ensuring scalability and performance, and optimizing cloud costs. Candidates should have extensive cloud experience and proficiency in automation and monitoring tools.

Qualifications

  • Proven experience managing and optimizing a diverse infrastructure stack.
  • Extensive knowledge of cloud platforms and infrastructure as code.
  • Strong understanding of disaster recovery and business continuity planning.

Responsibilities

  • Design, implement, and manage robust infrastructure solutions.
  • Ensure reliability, scalability, and performance of infrastructure.
  • Optimize cloud infrastructure costs.

Skills

Cloud Platforms
Infrastructure as Code
Containerization
Network Protocols
Scripting
Monitoring Tools

Tools

Terraform
AWS CloudWatch
Grafana
Docker
Kubernetes
Wireshark

Job description

Senior Site Reliability Engineer - AWS Kubernetes
Senior Site Reliability Engineer - AWS Kubernetes

Get AI-powered advice on this job and more exclusive features.

A truly unique opportunity to help launch a brand new team within a global financial services provider. This new team of highly skilled Full Stack Infrastructure Engineers will cover Compute, Storage, Network and Cloud technologies. You will help design, implement, and manage robust infrastructure solutions, ensuring reliability, scalability, and performance.

Requirements:

  • Proven experience managing and optimizing a diverse infrastructure stack.
  • Extensive knowledge of cloud platforms (AWS, Azure, GCP) and infrastructure as code (Terraform, CloudFormation).
  • Familiarity of service mesh technologies (Istio, Linkerd).
  • Solid understanding of virtualization (VMware, Hyper-V) and containerization (Docker, Kubernetes) and orchestration.
  • Understanding of storage solutions (SAN, NAS, cloud storage) and backup systems.
  • Strong understanding of network protocols, routing, switching, and firewalls. • Experience with load balancers (F5, HAProxy, Nginx) and network monitoring tools.
  • Experience in DNS management and troubleshooting.
  • Experience in network security best practices.
  • Proficiency in monitoring and observability tools (Prometheus, Grafana, Splunk).
  • Proficiency in at least one scripting language (Python, Bash) for automation.
  • Experience with CI/CD pipeline management and DevOps practices.
  • Strong understanding of disaster recovery and business continuity planning.
  • Experience with performance tuning and capacity planning.
  • Understanding of chaos engineering principles and practices.
  • Skills in cost optimization for cloud infrastructure.

Specific Tools and Techniques:
  • Experience in using cloud native monitoring tools like AWS CloudWatch, Azure Monitor, and Google Cloud Operations Suite.
  • Experience with packet capture tools like Wireshark for troubleshooting network issues.
  • Experience in using traceroute utilities and performance analysis tools like perf for identifying and resolving bottlenecks.
  • Familiarity with tools such as ipconfig/ifconfig for viewing network configurations, flushing DNS, and diagnosing network issues.
  • Experience with SNMP-based tools for network device monitoring and performance management.
  • Experience in using NetFlow for network traffic analysis.
  • Experience with tools like iostat, vmstat, and dstat for monitoring storage and system performance.
  • Experience in tools like df, du, lsblk, and fdisk for managing and troubleshooting file systems and disk partitions.
  • Familiarity with tools like Prometheus and Grafana for monitoring and observability

Seniority level
  • Seniority level
    Not Applicable
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Information Technology
  • Industries
    Computer and Network Security

Referrals increase your chances of interviewing at Source Technology by 2x

Sign in to set job alerts for “Senior Site Reliability Engineer” roles.

London, England, United Kingdom 1 month ago

London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 4 hours ago

London, England, United Kingdom 3 days ago

London, England, United Kingdom 2 weeks ago

Senior Site Reliability Engineer (Content Delivery Network)

London, England, United Kingdom 2 days ago

Senior Site Reliability Engineer, Production Engineering

London, England, United Kingdom 1 day ago

London, England, United Kingdom 4 hours ago

Staines-Upon-Thames, England, United Kingdom 5 months ago

Greater London, England, United Kingdom 3 weeks ago

Codified Controls - Senior Full-Stack Software Engineer - VP

London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 2 days ago

London, England, United Kingdom 17 hours ago

Greater London, England, United Kingdom 4 weeks ago

DevOps Engineering Manager (Russian Speaking)

London, England, United Kingdom 4 months ago

Solution Architect – Cloud-Native & DevOps

London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 1 month ago

London, England, United Kingdom 6 months ago

London, England, United Kingdom 1 month ago

London, England, United Kingdom 3 days ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 1 month ago

London, England, United Kingdom 1 week ago

Senior C++ Engineer (Mandarin Speaker) - London / Remote from the UK

London, England, United Kingdom 1 month ago

London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 4 hours ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Auros

Greater London

Remote

GBP 60,000 - 100,000

16 days ago

Site Reliability Engineer, Americas

TN United Kingdom

London

Remote

GBP 55,000 - 90,000

19 days ago

Senior Site Reliability Engineer, Production Engineering New London, Greater London, England, U[...]

ThousandEyes

London

Hybrid

GBP 70,000 - 100,000

Today
Be an early applicant

Senior Site Reliability Engineer

Prima

London

Hybrid

GBP 70,000 - 90,000

Today
Be an early applicant

Site Reliability Engineer

ZipRecruiter

Chelmsford

Remote

GBP 60,000 - 100,000

10 days ago

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

Future Talent Group

Greater London

Remote

GBP 50,000 - 90,000

18 days ago

Senior Site Reliability Engineer, Production Engineering

ThousandEyes (part of Cisco)

London

Hybrid

GBP 70,000 - 90,000

Today
Be an early applicant

Senior Site Reliability Engineer - AWS Kubernetes

JR United Kingdom

London

On-site

GBP 70,000 - 90,000

4 days ago
Be an early applicant

Site Reliability Engineer – FinTech / Global Payments – London HQ / Remote First

JR United Kingdom

London

Remote

GBP 60,000 - 95,000

16 days ago