Enable job alerts via email!

Senior Site Reliability Engineer

Cvent

Fredericton

On-site

CAD 90,000 - 120,000

Full time

2 days ago
Be an early applicant

Job summary

A leading technology firm is seeking a Senior Site Reliability Engineer in Fredericton, Canada. The role involves running Infrastructure as Code applications, optimizing pipeline processes, and guiding development teams to improve application reliability. Ideal candidates have strong experience with scripting, AWS management, and configuration tools, along with good communication skills. We focus on automation and AI-driven efficiency in our operations.

Qualifications

  • Experience with Agile SDLC methodologies and championing CI/CD.
  • Experience managing AWS services / operational knowledge of managing applications in AWS.
  • Hands-on experience with Windows and Linux/Unix Administration.
  • Good communication skills.
  • Identifying and prioritizing issues and finding solutions.

Responsibilities

  • Run Infrastructure as Code applications and build pipelines.
  • Guide development teams through infrastructure decisions.
  • Conduct incident retrospectives and implement SLI/SLOs.
  • Develop build, test and deployment automation.
  • Focus on automation of tasks and AI-driven operational efficiency.

Skills

Agile SDLC methodologies
Scripting languages (Ruby, Groovy, Bash, PowerShell, Typescript, Python)
AWS services management
Configuration management tools (Chef, Puppet, Ansible)
Windows and Linux/Unix Administration
APM, monitoring, logging tools
Containerization concepts (Docker, ECS, EKS, Kubernetes)
NoSQL databases (MongoDB, Couchbase, Postgres)
GitHub Copilot or other AI tools

Tools

Jenkins

Job description

Overview

Cvent is a global meeting, event, travel, and hospitality technology leader, with more than 5000+ employees worldwide. As a leading cloud-based technology company, we have over 28,000+ customers, including 80% of the Fortune 100 companies, in more than 100 countries.

Cvent’s software solutions optimize the entire event management value chain and have enabled clients around the world to manage hundreds of thousands of meetings and events. In addition to helping event planners navigate every aspect of the event process, we also provide an integrated platform to hoteliers to help create qualified demand for their hotels, manage that demand more efficiently, and measure their business performance in real-time.

In This Role, You Will

As a Senior Site Reliability Engineer, you will run Infrastructure as Code applications, build pipelines, and enable development teams. You will guide development teams through infrastructure decisions, conduct incident retrospectives, and implement and maintain SLI/SLOs. You will also help teams evaluate their reliability posture and prioritize work to solve reliability issues. As a Cvent SRE you will be a force for positive change and drive continuous improvement.

Additionally, You Will

  • Enlighten, Enable and Empower a fast-growing set of multi-disciplinary teams, across multiple applications and locations.
  • Tackle complex development, automation and business process problems.
  • Champion Cvent standards and best practices.
  • Ensure the scalability, performance, and resilience of our suite of products.
  • Work with the development and product team of a new application to establish the right monitoring and alerting strategy.
  • Develop build, test and deployment automation that seamlessly targets multiple on-premises and AWS regions.
  • Help a dev team working on a legacy code base to realize zero-down-time deployments.
  • Focus on automation of tasks and AI driven operational efficiency

Here's What You Need

  • Experience with Agile SDLC methodologies and championing CI/CD.
  • Scripting languages like Ruby, Groovy, Bash, PowerShell, Typescript or Python.
  • Exposure to managing AWS services / operational knowledge of managing applications in AWS
  • Experience with configuration management tools such as Chef, Puppet, Ansible or equivalent
  • Hands-on experience with Windows and Linux/Unix Administration
  • Working with APM, monitoring, and logging tools (Datadog, New Relic, Splunk)
  • Good understanding of containerization concepts - docker, ECS, EKS, Kubernetes
  • Experience managing 3 tier application stacks
  • Experience with build tools such as Jenkins
  • Working experience with NoSQL databases such as MongoDB, couchbase, postgres etc
  • F5 load balancing concepts
  • Understanding of basic networking concepts
  • Experience with package managers such as nexus, artifactory or equivalent
  • Experience with GitHub Copilot or other AI tools/agents
  • Good communication skills
  • Mentoring and supporting junior staff
  • Identifying and prioritizing issues, finding solutions to common problems by having a holistic view of Cvent systems.
  • Observability
  • Understanding of SRE concepts and the DevOps culture, with a focus on leveraging software engineering tools, methodologies, and concepts
  • Intermediate troubleshooting skills or leading incident response efforts.
  • Staying current with AI advancements through self-driven learning is preferred

Physical Demands

We are not able to offer sponsorship for this position
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.