Enable job alerts via email!

DevOps Engineer II - Auth0

Tek Ninjas

Ontario

Remote

CAD 80,000 - 120,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking an Observability Engineer to enhance monitoring and observability for their platform. This remote role focuses on maintaining and automating observability tooling, ensuring uptime and stability while collaborating with engineering teams. Ideal candidates will have experience in SRE or DevOps and a passion for observability tooling.

Qualifications

  • 3+ years of platform operations engineering, SRE, or DevOps experience.
  • Proficiency in Golang, Node.js, or Python.
  • Demonstrable expertise in monitoring distributed applications at scale.

Responsibilities

  • Run services in production environments and design for high growth.
  • Provision, configure, and monitor cloud-native infrastructure.
  • Automate operational tasks and improve scripts.

Skills

Monitoring
Observability
Automation
Collaboration

Tools

AWS
Google Cloud
Azure
Datadog
Sentry
Terraform
Kubernetes

Job description

100% remote (anywhere in Canada)

The Auth0 Platform Observability team owns the observability tooling that monitors the CIC Platform, and we are looking for an Observability Engineer to help ensure that our Product and Platform Engineers can monitor and observe our platform while continuing to rapidly ship software that our customers love.

If you have experience within the Site Reliability Engineering (SRE) field or working as a Development Operations (DevOps) engineer, and you have a passion for Observability tooling, this position will allow you to further your learning and development in these areas.

We are looking for engineers who are passionate about monitoring, observing, measuring uptime and availability, and ensuring stability for our platform. Our engineers maintain and automate observability tooling for our entire platform, including metrics, logs, and traces.

Responsibilities :
  1. Proficient in running services in production environments.
  2. Contribute to the process of designing services for high growth and high availability.
  3. Provision, configure, and monitor cloud-native infrastructure and services.
  4. Automate key processes. You might work on :
  5. Troubleshooting performance issues and operational issues.
  6. Automating operational tasks and improving scripts.
  7. Assisting with and providing feedback for performance testing and automation.
  8. Provisioning infrastructure and services in collaboration with product engineers.
  9. Collaborating with other engineering teams to help enhance their observability.
Qualifications :
  1. 3+ years of platform operations engineering, SRE, or DevOps experience.
  2. Experience with cloud infrastructure like AWS, Google Cloud, or Azure.
  3. Experience with Datadog (preferred) or other monitoring tools.
  4. Experience with Sentry (preferred) or other error reporting tools.
  5. Experience managing infrastructure with Terraform.
  6. Proficiency in Golang, Node.js, or Python.
  7. Demonstrable expertise in monitoring distributed applications at scale.
  8. Understanding of microservice architecture and best practices.
  9. Experience with Kubernetes.
  10. Team player who is willing to voice their opinion.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.