Enable job alerts via email!

Site Reliability Engineer

Conexiom

Vancouver

Remote

CAD 80,000 - 110,000

Full time

Yesterday
Be an early applicant

Job summary

A leading technology firm is seeking an experienced Site Reliability Engineer to ensure the reliability and scalability of cloud systems. The ideal candidate has expertise in Microsoft Azure and strong automation skills. Responsibilities include designing infrastructure, developing automation tools, and collaborating across teams to improve cloud solutions. This fully remote role offers significant growth opportunities in an inclusive culture.

Benefits

Impactful Work
Growth Opportunity
Remote Flexibility
Inclusive Culture

Qualifications

  • 3+ years in SRE, DevOps, Cloud Engineering, or similar roles.
  • Experience with Azure services like AKS, Functions, SQL, and App Services.
  • Proficient in Kubernetes, Docker, and container orchestration.
  • Experience with Infrastructure as Code tools like Terraform and Ansible.
  • Solid understanding of SRE principles.
  • Strong troubleshooting and communication skills.

Responsibilities

  • Design and maintain scalable infrastructure on Microsoft Azure.
  • Automate operational tasks using PowerShell, Python, or Bash.
  • Monitor system performance with tools like Datadog and Azure Monitor.
  • Collaborate with teams to improve deployment pipelines and CI/CD processes.
  • Drive continuous improvement in system reliability and operational efficiency.

Skills

SRE
DevOps
Cloud Engineering
Kubernetes
Docker
Azure
Automation
Infrastructure as Code
Scripting
Troubleshooting

Education

Bachelor’s degree in computer science or related field

Tools

Terraform
Ansible
Datadog
Azure Monitor

Job description

About Conexiom:

Conexiom empowers manufacturers and distributors to ship more ideal orders—that are accurate, on time, in full, and profitable. Emailed and unstructured sales orders are transformed into digital, touchless transactions with speed and precision by our self-learning AI and automation platform. Innovative AI agents drive continuous performance gains by turning root-cause insights into an automated action plan.


Join the team that leaders like Honeywell, Graybar, and Exxon rely on to fuel profitable growth, stronger customer relationships, and become the employer of choice. Conexiom is backed by leading growth capital partners Luminate Capital Partners, Warburg Pincus, and ICONIQ Capital.

About the Role:

We are looking for an experienced and dedicated Site Reliability Engineer (SRE) with extensive knowledge of Microsoft Azure to join our expanding infrastructure team. The SRE will play a critical role in guaranteeing the reliability, scalability, and performance of our cloud-based systems and services. This position requires close collaboration with development, operations, and security teams to develop and sustain comprehensive infrastructure and automation solutions. Your role will be pivotal in maintaining our commitment to delivering dependable and scalable cloud solutions.

Responsibilities:

  • Design, implement, and maintain scalable, secure, and highly available infrastructure on Microsoft Azure.
  • Develop and manage Infrastructure as Code (IaC) using tools like Terraform,Ansible,or ARM templates.
  • Monitor system performance and availability using tools such asDatadog,Azure Monitor, Log Analytics, and Application Insights.
  • Automate operational tasks and incident response using PowerShell, Python, orBash.
  • Collaborate with development teams to improve deployment pipelines and CI/CD processes using Azure DevOps, GitHub Actions, or similar tools.
  • Implement and enforce best practices for security, compliance, and cost optimization in Azure environments.
  • Participate in on-call rotations and incident response, conducting post-mortems and root cause analysis.
  • Drive continuous improvement in system reliability, observability, and operational efficiency.

Qualifications:

  • 3+ years inSRE, DevOps, CloudEngineering,or similarroles.
  • Directexperience with Azure services(i.e.,AKS, Functions,SQL,and App Services).
  • Proficientin Kubernetes, Docker, and container orchestration (e.g.,AKS, EKS, etc.).
  • Proficiency in scripting and automation using Python, PowerShell, or Bash.
  • Experience with Infrastructure as Code (IaC) tools, particularly Terraform and Ansible.
  • Proficient in usingDatadog for monitoringand alertingforAzure, Kubernetes, and CI/CD pipelines.
  • Solid understanding of SRE principles(SLO, SLI,error budgets).
  • Strongtroubleshooting and critical thinking skills.
  • Effective communicatoracross technical and non-technical teams.
  • ExperiencewithWindowsServerand Linuxadministration.
  • Familiar with IIS, Apache, and Nginx.
  • Experience managingrelational databases (PostgreSQL,Azure SQL,Cosmos DB).

Preffered Qualifications

  • Azure certifications (e.g., Azure Solutions Architect)
  • Strong understanding of Microservices based architecture.
  • Bachelor’s degree in computer scienceor related field.
Why Conexiom?

  • Impactful Work:Play a pivotal role in transforming how businesses operate on a global scale.
  • Growth Opportunity:Be part of an ambitious company on a rapid growth trajectory, offering numerous opportunities for personal and professional development.
  • Remote Flexibility:Enjoy the flexibility of a fully remote position, allowing you to work from anywhere.
  • Inclusive Culture:Join a diverse team of innovative thinkers and doers, committed to fostering an inclusive environment where everyone can thrive.
Conexiom is proud to offer equal employment opportunities. If you have a disability or need that requires accommodation at any time during the recruitment process, please let us know
#LI-Hybrid
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.