Enable job alerts via email!

Senior Site Reliability Engineer

Commify

Nottingham

Hybrid

GBP 65,000 - 75,000

Full time

16 days ago

Job summary

A global communications technology firm is seeking a Site Reliability Engineer to maintain system performance and enhance operational efficiencies. This role requires expertise in Microsoft Azure, Terraform, and Kubernetes, offering a competitive salary and flexible hybrid working. Join a vibrant team dedicated to shaping the digital future with innovative solutions.

Benefits

Competitive Salary (£65 - 75,000)

Flexible hybrid working

27 days annual leave plus national holidays

Mental Health Support through Calm

Unlimited professional & personal learning

Qualifications

Strong expertise in Terraform, App Services, and Kubernetes.
Experience in creating and modifying Terraform deployments.
Prior experience in an operations role, ideally as a Site Reliability Engineer.

Responsibilities

Maintaining high levels of system performance through monitoring and performance tuning.
Implementing scalability and fault tolerance.
Collaborating with engineering teams to support high-throughput production environments.

Skills

Microsoft Azure

Terraform

App Services

Kubernetes

Fluent in English

Reliability in systems

Automation scripting

Tools

Datadog

Azure Application Insights

Terraform deployments

At Commify, we're not just a company—we're a globally connected team of innovators who love what we do. As a CPaaS leader with 25 years of groundbreaking experience, we're the force behind over 5 billion customer interactions each year, enabling businesses worldwide to connect via advanced channels like SMS, RCS, and complex mobile journeys.

Our culture is our core strength. Operating across the UK, EMEA, the USA, and Australia, we've fostered a truly diverse and connected environment, earning a consistent 4 out of 5 culture score in our employee engagement surveys. You'll join a vibrant team where your diverse experience makes a daily global impact.

We need talented people to grow a global company where everyone feels proud to belong, have a purpose and do their best to directly shape the digital future.

About The Role:

We’re on the look out for a super talented Site Reliability Engineer to join our team, you will be an integral part of our SRE team. Your focus will be on ensuring that our products and platforms perform at their best, understanding how our software interacts with both physical and Cloud infrastructure to deliver exceptional messaging solutions.

Key Responsibilities:

Maintaining high levels of system performance through monitoring and performance tuning
Implementing scalability and fault tolerance
Automating processes and improving operational efficiencies
Troubleshooting application and middleware challenges
Collaborating with engineering teams to support high-throughput production environments
Building and maintaining robust deployment pipelines

What You’ll Bring:

Proficiency with Microsoft Azure
Strong expertise in Terraform, App Services, and Kubernetes
Fluent in both written and spoken English
A genuine passion for reliability in systems
Experience in creating and modifying Terraform deployments
Prior experience in an operations role, ideally as a Site Reliability Engineer
Ability to work cross-functionally, take ownership of tasks, and prioritize effectively
Excellent communication and collaboration skills
Experience with monitoring solutions (e.g., Datadog, Azure Application Insights, Log Analytics)
Programming/scripting skills for automation (favoring PowerShell, but also comfortable with Bash, C#, Ruby, or Python)
Experience with web-based applications

It's desirable for you to have:

Familiarity with Azure DevOps pipelines
Experience with Microsoft Server Operating Systems
Understanding of service level objectives and operational requirements for cloud-based solutions
Comprehensive knowledge of Microsoft Azure Cloud offerings (especially in PaaS)
Experience with tools such as Terraform, Ansible, VSTS, ARM, Puppet, Chef, Jenkins, ELK, and Grafana
Understanding of DNS, Load Balancer configuration, Active Directory, and network infrastructure in the cloud
Experience in agile environments and methodologies including TDD, Scrum, or Kanban
Knowledge of monitoring and alerting systems for microservice architectures
Applied knowledge of cloud security best practices

What We Offer:

Competitive Salary (£65 - 75,000)
Flexible hybrid working
27 days annual leave plus national holidays.
Enhance family leave
Enjoy your Birthday off - because it's your day!
Mental Health Support through our Wellbeing partner, Calm
Wellbeing leave and a Mental Health First Aider program
Giving back days to help support causes close to your heart
Unlimited professional & personal learning
Total Rewards including retirement planning, healthcare and life assurance

We also have epic team socials, and we know how to celebrate in style!

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.