Enable job alerts via email!

Principal Cloud Reliability Engineer

The Hartford

Hartford (CT)

Hybrid

USD 163,000 - 245,000

Full time

10 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player seeks a Principal Cloud Reliability Engineer to enhance cloud automation and reliability across diverse services. This role emphasizes building and optimizing infrastructure capabilities, ensuring application availability, and implementing IT security measures. You will foster an engineering culture focused on automation, driving continuous improvement and operational efficiency. The ideal candidate will possess extensive experience in cloud engineering and automation tools, with a passion for innovation and leadership. Join this dynamic team and make a significant impact in shaping the future of cloud services.

Qualifications

  • 8+ years in engineering and operations with innovation and leadership.
  • Strong cloud engineering expertise with public cloud providers.
  • Experience with automation and performance tools.

Responsibilities

  • Build reliability engineering and automation across 200+ services.
  • Support Operations, RE, DevSecOps, and Middleware technologies.
  • Conduct market research on emerging technology trends.

Skills

Cloud Engineering
Automation
Troubleshooting
Critical Thinking
Agile Methodologies

Education

Master’s degree in Computer Science

Tools

Ansible
Terraform
Dynatrace
Splunk
CloudWatch
CloudTrail
GitHub
Rally
SonarQube

Job description

Join to apply for the Principal Cloud Reliability Engineer role at The Hartford

We’re determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals – and to help others accomplish theirs, too. Join our team as we help shape the future.

The Hartford’s Cloud Services team is seeking an experienced and highly motivated principal engineer responsible for driving Reliability Engineering for multiple cloud services. The principal engineer will build, optimize, and maintain cloud automation capabilities to enable infrastructure provisioning, application availability, testing, quality, deployment, resiliency, recovery, and efficiency of IT applications and platforms.

This role will also ensure the implementation of IT Security and service hardening requirements. Key success measures include service reliability (availability, latency, quality), technical debt reduction, and cost efficiency.

This position offers a Hybrid work schedule, requiring 3 days a week in the Hartford, CT or Charlotte, NC office. Candidates must be authorized to work in the US without sponsorship. The company does not support STEM OPT I-983 Training Plan endorsement for this role.

Responsibilities
  1. Build reliability engineering, automation, and quality capabilities across 200+ infrastructure services within our Cloud transformation landscape.
  2. Support Operations, RE, DevSecOps, Quality, and Middleware technologies.
  3. Develop tools and capabilities for software engineering teams to optimize development, improve technology, and increase efficiency.
  4. Foster an engineering culture emphasizing automation across our technology stack and application architectures, enhancing developer experience and IT productivity.
  5. Support enterprise cloud needs by improving Performance, Scalability, Resiliency, Reliability, Stability, Observability, and Security, while continuously modernizing services to boost developer productivity, automation, quality, and operational costs.
  6. Achieve annual cloud optimization targets and manage error budgets effectively.
  7. Enable support functions to 'shift left' from Infrastructure to application teams.
  8. Conduct market research on emerging technology trends and secure development practices.
  9. Lead change management initiatives to promote automation adoption and cultivate a culture of continuous learning and improvement.
Qualifications
  1. 8+ years of experience in technical roles involving engineering, application management, and operations, with a proven record of innovation and leadership in diverse teams.
  2. Strong cloud engineering expertise across public cloud providers and managing highly reliable, automated environments.
  3. Proven ability to develop, mature, and deliver reliability engineering tools and capabilities.
  4. Deep knowledge of cloud product management, engineering, and Agile methodologies.
  5. Experience with automation tools like Ansible and Terraform.
  6. Experience with performance and observability tools such as Dynatrace, Splunk, CloudWatch, and CloudTrail.
  7. Familiarity with CI/CD and DevOps tools like GitHub, Rally, SonarQube.
  8. Strong troubleshooting, issue-resolution, and root cause analysis skills.
  9. Experience maintaining cloud-based and on-prem automation tools across various service models.
  10. Ability to act as a strategic thought leader and credible business partner, with strong critical thinking skills.
  11. Master’s degree in Computer Science or related field preferred.
Compensation

The annual base pay range is $163,040 - $244,560, subject to factors like performance and competencies. Compensation may include bonuses, incentives, and recognition beyond base salary.

Additional Details
  • Seniority level: Mid-Senior level
  • Employment type: Full-time
  • Job function: Engineering and IT
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead Site Reliability Engineer - Cloud Platforms

Jobot

Evansville

Remote

USD 160,000 - 200,000

Yesterday
Be an early applicant

Principal SRE (Site Reliability Engineer) - Remote

SailPoint

Remote

USD 176,000 - 252,000

Yesterday
Be an early applicant

Principal Site Reliability Engineer

Lumen Argentina

Aurora

Remote

USD 156,000 - 209,000

Today
Be an early applicant

Lead Site Reliability Engineer - Cloud Platforms

Jobot

Minneapolis

Remote

USD 160,000 - 200,000

Yesterday
Be an early applicant

Lead Site Reliability Engineer - Cloud Platforms

Jobot

Hammond

Remote

USD 160,000 - 200,000

Yesterday
Be an early applicant

Lead Site Reliability Engineer - Cloud Platforms

Jobot

Atlanta

Remote

USD 160,000 - 200,000

Yesterday
Be an early applicant

Lead Site Reliability Engineer - Cloud Platforms

Jobot

Memphis

Remote

USD 160,000 - 200,000

Yesterday
Be an early applicant

Manager Site Reliability Engineer ServiceNow

NBCUniversal

Englewood Cliffs

Remote

USD 140,000 - 175,000

Yesterday
Be an early applicant

Manager, Site Reliability Engineer (ServiceNow)

NBC Universal

Englewood Cliffs

Remote

USD 140,000 - 175,000

Yesterday
Be an early applicant