Enable job alerts via email!

Staff Cloud Operations Engineer – Monitoring Lead (9810)

Extreme Networks, Inc.

Ontario

On-site

CAD 100,000 - 130,000

Full time

Today
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading technology company is seeking a Staff Cloud Operations Engineer – Monitoring Lead to enhance their monitoring and alerting strategies across cloud infrastructure. This role involves significant responsibility, including tool evaluation, automation scripting, and providing ongoing support, ideal for someone with extensive experience in cloud operations and a passion for innovation.

Qualifications

  • 8+ years of experience in Cloud Operations, DevOps, or SRE roles.
  • Deep expertise with AWS, Azure, or GCP.
  • Knowledge of container-based architectures and monitoring tools.

Responsibilities

  • Lead design and improvement of monitoring frameworks for cloud infrastructure.
  • Develop automation scripts for monitoring deployment.
  • Provide 24/7 support for cloud services.

Skills

Problem-solving
Troubleshooting
Cloud Operations
Leadership
Monitoring

Education

BS degree in Computer Science, Engineering, or related field

Tools

Prometheus
Grafana
Datadog
Splunk
CloudWatch
Azure Monitor
GCP Operations Suite
Docker
Kubernetes

Job description

There has never been a better time to join Extreme, with several acquisitions extending our portfolio and go-to-market strategy. We have seen enormous opportunity and growth within the region.

Aside from being a Technology Leader in the Gartner Magic Quadrant, we also promote an internal culture that embraces diversity, inclusion, and equality in the workplace. Having Diversity and Inclusion as part of our core values, we’re proud to foster an environment where every Extreme employee can thrive because of their differences, not despite them.

Staff Cloud Operations Engineer – Monitoring Lead

We are seeking a highly skilled and experienced Staff Cloud Operations Engineer – Monitoring Lead to join our Cloud Operations team. In this role, you will be responsible for designing, implementing, and optimizing our monitoring and alerting strategies across our cloud infrastructure and applications. Your work will drive proactive issue identification, system health assurance, and contribute to our operational excellence and reliability goals. We are looking for top talent who are passionate about their work and eager to make a difference.

  • Lead the design, implementation, and continuous improvement of our monitoring and alerting framework for cloud infrastructure (AWS, Azure, GCP), applications, and services.
  • Define KPIs, SLIs, and SLOs for critical systems.
  • Evaluate, select, and integrate monitoring tools (e.g., Prometheus, Grafana, Datadog, Splunk, CloudWatch, Azure Monitor, GCP Operations Suite).
  • Develop automation scripts and tools (e.g., Python, Bash, PowerShell) to streamline monitoring deployment and incident response.
  • Build and maintain dashboards, alerts, and reports for system performance and health insights.
  • Analyze monitoring data to identify performance issues, resource inefficiencies, and cost-saving opportunities.
  • Collaborate with engineering teams to implement performance improvements and cost optimizations.
  • Create and maintain documentation for monitoring systems and procedures.
  • Identify areas for improvement in cloud operations and monitoring capabilities.
  • Provide 24/7 support for cloud services.
  • Participate in cloud security and compliance initiatives.

Ideal Qualifications :

  • BS degree in Computer Science, Engineering, or related field.
  • 8+ years of experience in Cloud Operations, DevOps, or SRE roles, with a focus on monitoring.
  • Deep expertise with at least one major public cloud platform (AWS, Azure, GCP).
  • Proven leadership or senior contributor experience in monitoring roles.
  • Knowledge of container-based architectures (Docker, Kubernetes).
  • Extensive experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk).
  • Strong problem-solving and troubleshooting skills.
  • Knowledge of Elasticsearch, PostgreSQL, Redis, Ignite, Kafka, and RabbitMQ.
  • Comfortable working across multiple time zones within a distributed team.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Cloud Operations Engineer – Monitoring Lead (9810)

Extreme Networks, Inc.

null null

On-site

On-site

CAD 100,000 - 140,000

Full time

16 days ago

Cloud Operations Engineer â Monitoring Lead (9394)

Extreme Networks

Toronto null

Hybrid

Hybrid

CAD 120,000 - 140,000

Full time

Today
Be an early applicant

Staff Security Operations Engineer

Canonical

Trois-Rivières null

Remote

Remote

CAD 100,000 - 130,000

Full time

Yesterday
Be an early applicant

Staff Security Operations Engineer

Canonical

Moncton null

Remote

Remote

CAD 90,000 - 130,000

Full time

Yesterday
Be an early applicant

Cloud Operations Engineer – Monitoring Lead (9394)

Extreme Networks

Toronto null

Hybrid

Hybrid

USD 120,000 - 140,000

Full time

Yesterday
Be an early applicant

Cloud Operations Engineer – Monitoring Lead (9394)

Extreme Networks, Inc.

Toronto null

Hybrid

Hybrid

CAD 120,000 - 140,000

Full time

Yesterday
Be an early applicant

Manager, Security Monitoring and Response - Payments Canada.

Hack The Box

Ottawa null

On-site

On-site

CAD 112,000 - 141,000

Full time

7 days ago
Be an early applicant

Portfolio Monitoring Analytics Lead

Pinnacle Enterprise Risk Consulting Services, LLC

Toronto null

On-site

On-site

CAD 100,000 - 130,000

Full time

11 days ago

Monitoring Solutions & Equipment – Domain Lead, Mining

Stonewood Group Inc.

Ottawa null

On-site

On-site

CAD 100,000 - 150,000

Full time

11 days ago