Enable job alerts via email!

Site Reliability Engineer II

Cvent

New Brunswick

On-site

CAD 80,000 - 100,000

Full time

Today
Be an early applicant

Job summary

A leading technology company in Canada, New Brunswick, is seeking a Site Reliability Engineer 2 to manage infrastructure applications and support multi-disciplinary teams. The ideal candidate has experience with AWS, scripting languages, and a solid understanding of DevOps practices. This full-time role offers opportunities for continuous improvement and collaboration across various applications.

Qualifications

  • Experience with Agile methodologies and CI/CD.
  • Proficient in scripting languages like Ruby or Python.
  • Experience in AWS services and applications.
  • Knowledge of configuration management tools.

Responsibilities

  • Manage infrastructure as code applications.
  • Build pipelines and enable development teams.
  • Support incident retrospectives and implement SLIs/SLOs.
  • Develop automation processes for deployment.

Skills

Agile SDLC methodologies
Scripting languages (Ruby, Groovy, etc.)
AWS services management
Configuration management tools (Chef, Puppet, etc.)
Windows and Linux/Unix Administration
APM and monitoring tools (Datadog, etc.)
Containerization (Docker, Kubernetes)
Jenkins build tools
NoSQL databases (MongoDB, etc.)
Networking knowledge
GitHub Copilot
Problem-solving skills
Observability principles
Incident response
DevOps culture
Job description
Overview

Cvent is a global meeting, event, travel, and hospitality technology leader, with more than 5000+ employees worldwide. As a leading cloud-based technology company, we have over 28,000+ customers, including 80% of the Fortune 100 companies, in more than 100 countries.

Cvent’s software solutions optimize the entire event management value chain and have enabled clients around the world to manage hundreds of thousands of meetings and events. In addition to helping event planners navigate every aspect of the event process, we also provide an integrated platform to hoteliers to help create qualified demand for their hotels, manage that demand more efficiently, and measure their business performance in real-time.

In This Role, You Will

As a Site Reliability Engineer 2, you will play a critical role in managing Infrastructure as Code applications, building pipelines, and enabling development teams. You will work alongside development teams to guide infrastructure decisions, assist with incident retrospectives, and help implement and maintain SLI / SLOs. Additionally, you will support teams in evaluating their reliability posture and prioritize efforts to resolve reliability issues. As an SRE 2 at Cvent, you will contribute to fostering positive change and driving continuous improvement.

Additionally, You Will
  • Support and empower a growing set of multi-disciplinary teams across various applications and locations.
  • Address complex development, automation, and business process challenges.
  • Promote Cvent's standards and best practices.
  • Ensure the scalability, performance, and resilience of our product suite.
  • Collaborate with development and product teams on new applications to establish effective monitoring and alerting strategies.
  • Develop automation for build, test, and deployment processes targeting multiple on-premises and AWS regions.
  • Assist development teams working on legacy code bases to achieve zero-downtime deployments.
  • Focus on task automation and AI-driven operational efficiency.
Here's What You Need

We believe that passion and willingness to learn outweigh any list of skills; however, having experience in some of the areas below would help you hit the ground running and succeed as an SRE at Cvent.

  • Familiarity with Agile SDLC methodologies and experience in championing CI / CD.
  • Proficiency in scripting languages such as Ruby, Groovy, Bash, PowerShell, Typescript, or Python.
  • Experience managing AWS services and operational knowledge of applications in AWS.
  • Experience with configuration management tools like Chef, Puppet, Ansible, or equivalent.
  • Exposure to Windows and Linux / Unix Administration.
  • Experience with APM, monitoring, and logging tools (e.g., Datadog, New Relic, Splunk).
  • Understanding of containerization concepts, including Docker, ECS, EKS, and Kubernetes.
  • Experience managing three-tier application stacks.
  • Experience with build tools such as Jenkins.
  • Familiarity with NoSQL databases like MongoDB, Couchbase, Postgres, etc.
  • Understanding of F5 load balancing concepts and basic networking knowledge.
  • Experience with package managers like Nexus, Artifactory, or equivalent.
  • Familiarity with GitHub Copilot or other AI tools / agents.
  • Strong communication skills.
  • Ability to identify and prioritize issues and solve problems with a holistic view of Cvent systems.
  • Understanding of observability principles.
  • Intermediate troubleshooting skills and experience in leading incident response efforts.
  • Knowledge of SRE concepts and DevOps culture, focusing on leveraging software engineering tools and practices.
Seniority level
  • Not Applicable
Employment type
  • Full-time
Job function
  • Engineering and Information Technology
Industries
  • Software Development

J-18808-Ljbffr

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.