Enable job alerts via email!

Platform Engineer IV - Disaster Recovery, Resiliency and Chaos Engineering

Capital Group

Irvine (CA)

On-site

USD 153,000 - 247,000

Full time

15 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in the finance sector is seeking a Platform Engineer IV specializing in Disaster Recovery, Resiliency, and Chaos Engineering. This role involves designing automation scripts, analyzing recovery trends, and mentoring team members. The ideal candidate will have hands-on experience with AWS tools and proficiency in programming languages such as Python and JavaScript. Competitive salary and benefits are offered, including performance bonuses and a retirement plan.

Benefits

Performance bonuses
Profit sharing
Retirement plan with 15% employer contribution

Qualifications

  • Hands-on experience with AWS recovery tools.
  • Proficiency in modern programming languages like Python and JavaScript.
  • Experience with automation and deployment tools.

Responsibilities

  • Design and implement automation scripts for disaster recovery.
  • Analyze recovery trends and improve platform recovery health.
  • Lead disaster recovery exercises and post-mortem analyses.

Skills

AWS recovery tools
Infrastructure as Code (IaC)
Python
JavaScript
Terraform
Chef
Puppet
Ansible
Communication
Analytical skills
Problem-solving skills

Job description

Platform Engineer IV - Disaster Recovery, Resiliency and Chaos Engineering

Join to apply for the Platform Engineer IV - Disaster Recovery, Resiliency and Chaos Engineering role at Capital Group.

This role involves partnering with infrastructure and application teams to design and implement automation scripts, templates, and workflows for disaster recovery, including resiliency elements such as provisioning, scaling, configuration management, monitoring, and testing.

You will analyze recovery and resiliency trends, collaborate to improve platform and application recovery health, and evaluate resiliency readiness. The role requires defining performance metrics, designing monitoring tools, and developing dashboards for observability.

You will also be responsible for continuous learning, mentoring team members, standardizing resiliency requirements for vendors, and leading disaster recovery exercises and post-mortem analyses.

Qualifications include:
  • Hands-on experience with AWS recovery tools (e.g., Resiliency Hub, Fault Injector Service)
  • Knowledge of Infrastructure as Code (IaC) and automation tools
  • Proficiency in modern programming languages (e.g., Python, JavaScript, Terraform)
  • Experience with automation and deployment tools (e.g., Chef, Puppet, Ansible, Terraform)
  • Strong communication, analytical, and problem-solving skills
  • Experience delivering cross-team projects
Additional details:

Base salary in Southern California: $153,965 - $246,344. Benefits include performance bonuses, profit sharing, and a retirement plan with 15% employer contribution.

Employment details:
  • Level: Mid-Senior
  • Type: Full-time
  • Function: Engineering and IT

We are an equal opportunity employer, committed to diversity and inclusion.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.