Enable job alerts via email!

Lead Infrastructure Engineer - Network Operation

JPMORGAN CHASE BANK, N.A.

Singapore

On-site

SGD 95,000 - 130,000

Full time

3 days ago
Be an early applicant

Job summary

A leading financial services firm in Singapore is looking for a Lead Infrastructure Engineer to enhance infrastructure design and performance. You will be part of a high-performing team, implementing infrastructure and automation solutions, and addressing complex technical challenges. Candidates should have a Bachelor's degree in Computer Science, at least 5 years of site reliability engineering experience, and proficiency with various observability and automation tools. This position emphasizes collaboration and innovation in a dynamic environment.

Qualifications

  • 5+ years of site reliability engineering or related experience.
  • 3+ years of network engineering experience.
  • Strong ability to present information logically and compellingly.

Responsibilities

  • Guide and assist in building appropriate level designs.
  • Collaborate to design and implement deployment approaches.
  • Implement infrastructure and configuration as code.
  • Collaborate with stakeholders to resolve complex problems.
  • Use service level objectives to proactively resolve issues.
  • Improve network product reliability related nonfunctional requirements.

Skills

Site reliability engineering
Network engineering
Observability tools (Grafana, Prometheus)
Infrastructure automation (Ansible, Terraform)
Problem-solving

Education

Bachelor’s Degree in Computer Science

Tools

Grafana
Dynatrace
Prometheus
Splunk
AppDynamics
Ansible
Terraform
AppDynamics
Job description

Assume a vital position as a key member of a high-performing team that delivers infrastructure and performance excellence. Your role will be instrumental in shaping the future at one of the world's largest and most influential companies.

As a Lead Infrastructure Engineer at JPMorgan Chase within the Infrastructure Platform Team, you apply deep knowledge of software, applications, and technical processes within the infrastructure engineering discipline. Continue to evolve your technical and cross-functional knowledge outside of your aligned domain of expertise.

Job responsibilities
  • Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
  • Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
  • Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
  • Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
  • Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
  • Improve aspects of network products related to reliability related nonfunctional requirements such as logging, monitoring, observability, performance, scalability, capacity, resiliency, etc.
  • Perform research and discovery on industry tools and lead build versus buy
  • Collaborate with other network and software engineering teams to automate processes, reduce toil and modernize operations
  • Participate in on-call rotation as an escalation contact for production issues
  • Turn theory into practice, navigate through ambiguity to build a plan
  • Accomplish common goals using SCRUM practices
Required qualifications, capabilities, and skills
  • Bachelor’s Degree in Computer Science, Engineering, Mathematics or other related disciplines
  • Minimally 5 years of site reliability engineering or related experience
  • Minimally 3 years of network engineering or related experience
  • Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
  • Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
  • Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
  • Familiarity with troubleshooting common networking technologies and issues
  • Experience with one or more application performance management technologies (AppDynamics, Dynatrace, Riverbed SteelCentral, Prometheus)
  • Ability to initiate and implement ideas to solve business problems
  • Experience triaging and diagnosing issues in complex distributed architectures leveraging infrastructure and application telemetry
  • Experience with one or more infrastructure automation technologies (Ansible, Terraform, Puppet, building APIs and services using REST, SOAP, etc.)
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.