Enable job alerts via email!

Sr. Service Resiliency Engineer

Genesys

Ontario

On-site

CAD 80,000 - 120,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Senior Service Resiliency Engineer to enhance platform reliability at scale. In this hands-on role, you'll analyze system trends, implement innovative resilience strategies, and collaborate with engineering teams to ensure high availability across a microservices architecture. This position offers the opportunity to make a significant impact on user experiences worldwide while fostering a culture of empathy and innovation. If you are passionate about solving complex engineering challenges and driving improvements in system resilience, this is the perfect opportunity for you.

Qualifications

  • 5+ years in software engineering with a focus on resilience and reliability.
  • Strong understanding of microservices and distributed systems is essential.

Responsibilities

  • Analyze system stability and implement resilience patterns to improve reliability.
  • Collaborate with teams to embed reliability into design decisions.

Skills

Python
Go
Java
TypeScript
Analytical Skills
Communication Skills

Tools

AWS
CloudWatch
New Relic
SumoLogic
AWS CloudFormation
Terraform
CI/CD pipelines
OpenTelemetry

Job description

Sr. Service Resiliency Engineer

Apply

Locations: Ontario, Canada
Time Type: Full time
Posted on: Posted 2 Days Ago
Job Requisition ID: JR107509

Genesys empowers organizations of all sizes to improve loyalty and business outcomes by creating the best experiences for their customers and employees. Through Genesys Cloud, the AI-powered Experience Orchestration platform, organizations can accelerate growth by delivering empathetic, personalized experiences at scale to drive customer loyalty, workforce engagement, efficiency, and operational improvements.

We employ more than 6,000 people across the globe who embrace empathy and cultivate collaboration to succeed. And, while we offer great benefits and perks like larger tech companies, our employees have the independence to make a larger impact on the company and take ownership of their work. Join the team and create the future of customer experience together.

Join our Resiliency Engineering team at Genesys, where you’ll drive platform reliability at scale. As a Senior Service Resiliency Engineer, you will analyze system trends, lead resilience initiatives, and shape the architectural direction of our microservices platform. This is a hands-on, high-impact role where you’ll work closely with engineering teams to implement innovative resilience strategies across our global infrastructure.

Key Responsibilities:

  • Analyze system stability trends and identify areas for improvement across distributed services.
  • Implement resilience patterns (e.g., circuit breakers, retries, bulkheads, load shedding) to reduce system failures.
  • Develop and maintain automated tools for service health measurement and static analysis.
  • Guide engineering teams in applying best practices in system resilience and observability.
  • Apply chaos engineering principles to proactively identify weaknesses in the platform.
  • Research and evaluate emerging technologies related to service resilience and high availability.
  • Drive enablement efforts through documentation, standards, and presentations to technical and non-technical audiences.
  • Manage the full development lifecycle for resilience-focused tools and enhancements.
  • Collaborate with platform and service teams to embed reliability into design decisions.
  • Contribute to team-wide initiatives that support Genesys’ innovation and customer trust.

Qualifications:

  • Minimum 5 years of experience in software engineering using Python, Go, Java, or TypeScript.
  • Strong understanding of microservices architecture and distributed systems.
  • Minimum 2 years of hands-on experience with AWS.
  • Proven ability to diagnose and resolve complex systems issues.
  • Proficient with observability tools such as CloudWatch, New Relic, or SumoLogic.
  • Clear and effective communication skills, both technical and non-technical.
  • Data-driven approach to problem-solving with strong analytical skills.
  • Commitment to continuous learning and embracing empathy in technical collaboration.

Preferred Qualifications:

  • Previous experience in SRE or DevOps roles.
  • Experience implementing resilience patterns such as retries, timeouts, and bulkheads.
  • Exposure to chaos engineering or failure injection frameworks.
  • Familiarity with performance testing tools and techniques.
  • Hands-on experience with AWS CloudFormation, Terraform, and CI/CD pipelines.
  • Experience with OpenTelemetry and distributed tracing.
  • Understanding of SLIs, SLOs, and error budgets.
  • Contributions to open-source projects or technical blogs.

Why Join Us?
Be part of a team that’s redefining resilience at Genesys—where your ideas will directly impact the experiences of millions of users worldwide. We foster a culture of empathy, innovation, and collaboration, and we’re looking for leaders who thrive on solving tough engineering problems with creativity and heart.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Sr. Service Resiliency Engineer

Genesys Telecommunications Laboratories, Inc.

Ontario

On-site

CAD 80,000 - 120,000

30+ days ago

Sr. Service Resiliency Engineer

Genesys Cloud Services, Inc.

Ontario

On-site

CAD 100,000 - 130,000

4 days ago
Be an early applicant

Sr. Service Resiliency Engineer

Genesys

Ontario

On-site

CAD 100,000 - 130,000

4 days ago
Be an early applicant

Sr. Service Resiliency Engineer

Genesys Cloud Services, Inc.

Ontario

On-site

CAD 80,000 - 100,000

30+ days ago