Job Search and Career Advice Platform

Enable job alerts via email!

Lead Site Reliability Engineer

JPMorgan Chase & Co.

Glasgow

On-site

GBP 100,000 - 125,000

Full time

30 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading financial institution in Glasgow seeks a Lead Site Reliability Engineer. In this role, you'll lead efforts to enhance application reliability, serve as a key technical advisor, and mentor team members. Your expertise in tools like Docker and Kubernetes, along with experience in programming and data-driven analytics, will be vital for success. Join a dynamic team that values innovation and performance, offering opportunities for professional growth and collaboration across various levels.

Qualifications

  • Fluency in at least one programming language such as Python or Java.
  • Proficiency in continuous integration and continuous delivery tools.
  • Experience with troubleshooting common networking technologies.

Responsibilities

  • Lead initiatives to improve application reliability using data-driven analytics.
  • Act as the main point of contact during major incidents.
  • Document and share knowledge within the organization.

Skills

Problem-solving
Collaboration
Mentorship
Technical expertise

Tools

Grafana
Dynatrace
Docker
Kubernetes
Jenkins
Job description

Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.

As a Lead Site Reliability Engineer at JPMorgan Chase within Risk Technology Team, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Take lead and conduct resiliency design reviews, break up complex problems into digestible work for other engineers, act as a technical lead for medium to large‑sized products, and provide advice and mentoring to other engineers.

Job responsibilities
  • Demonstrates and champions site reliability culture and practices and exerts technical influence throughout your team
  • Leads initiatives to improve the reliability and stability of your team’s applications and platforms using data‑driven analytics to improve service levels
  • Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers
  • Demonstrates a high level of technical expertise within one or more technical domains and proactively identifies and solves technology‑related bottlenecks in your areas of expertise
  • Acts as the main point of contact during major incidents for your application and demonstrates the skills to identify and solve issues quickly to avoid financial losses
  • Documents and shares knowledge within your organization via internal forums and communities of practice
Required qualifications, capabilities, and skills
  • Formal training or certification on reliability, scalability, performance, security, enterprise system architecture, toil reduction concepts and proficient advanced experience
  • Fluency in at least one programming language such as Python, Java Spring Boot, Unix Shell.
  • Deep knowledge of software applications and technical processes with emerging depth in one or more technical disciplines
  • Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Geneos, Dynatrace, Prometheus, Datadog, Splunk, etc.
  • Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
  • Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)
  • Experience with troubleshooting common networking technologies and issues
  • Ability to identify and solve problems related to complex data structures and algorithms
  • Drive to self‑educate and evaluate new technology and ability to teach new programming languages to team members
  • Ability to expand and collaborate across different levels and stakeholder groups
  • Working knowledge on Apache, Tomcats, TomEE.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.