Enable job alerts via email!

Lead Infrastructure Engineer - Network Operation

JPMorgan Chase & Co.

Singapore

On-site

SGD 100,000 - 130,000

Full time

30+ days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading global financial institution is looking for a Lead Infrastructure Engineer to join their Infrastructure Platform Team in Singapore. The ideal candidate will have a strong technical background in site reliability and network engineering, with experience in automation tools and observability. Responsibilities include implementing infrastructure as code, resolving complex problems, and collaborating with cross-functional teams to enhance network reliability. This is a critical position that ensures high performance and excellence in service delivery.

Qualifications

Minimum 5 years of site reliability engineering or related experience.
Minimum 3 years of network engineering or related experience.
Familiarity with troubleshooting common networking technologies and issues.

Responsibilities

Guide and assist others in building appropriate level designs.
Implement infrastructure and network as code for applications.
Collaborate with teams to design and implement deployment approaches.

Skills

Site reliability engineering

Network engineering

Observability

Collaboration

Problem-solving

Education

Bachelor’s Degree in Computer Science, Engineering, Mathematics or related disciplines

Tools

Grafana

Dynatrace

Prometheus

Ansible

Terraform

Assume a vital position as a key member of a high-performing team that delivers infrastructure and performance excellence. Your role will be instrumental in shaping the future at one of the world's largest and most influential companies.

As a Lead Infrastructure Engineer at JPMorgan Chase within the Infrastructure Platform Team, youapply deep knowledge of software, applications, and technical processes within the infrastructure engineering discipline. Continue to evolve your technical and cross-functional knowledge outside of your aligned domain of expertise.

Job responsibilities

Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
Improve aspects of network products related to reliability related nonfunctional requirements such as logging, monitoring, observability, performance, scalability, capacity, resiliency, etc.
Perform research and discovery on industry tools and lead build versus buy
Collaborate with other network and software engineering teams to automate processes, reduce toil and modernize operations
Participate in on-call rotation as an escalation contact for production issues
Turn theory into practice, navigate through ambiguity to build a plan
Accomplish common goals using SCRUM practices

Required qualifications, capabilities, and skills

Bachelor’s Degree in Computer Science, Engineering, Mathematics or other related disciplines
Minimally 5 years of site reliability engineering or related experience
Minimally 3 years of network engineering or related experience.
Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
Familiarity with troubleshooting common networking technologies and issues
Experience with one or more application performance management technologies (AppDynamics, Dynatrace, Riverbed SteelCentral, Prometheus)
Ability to initiate and implement ideas to solve business problems
Experience triaging and diagnosing issues in complex distributed architectures leveraging infrastructure and application telemetry
Experience with one or more infrastructure automation technologies (Ansible, Terraform, Puppet, building APIs and services using REST, SOAP, etc.)

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Top companies

Top positions