Enable job alerts via email!

Senior Site Reliability Engineer

Commify

Nottingham

Hybrid

GBP 65,000 - 75,000

Full time

16 days ago

Job summary

A global communications technology firm is seeking a Site Reliability Engineer to maintain system performance and enhance operational efficiencies. This role requires expertise in Microsoft Azure, Terraform, and Kubernetes, offering a competitive salary and flexible hybrid working. Join a vibrant team dedicated to shaping the digital future with innovative solutions.

Benefits

Competitive Salary (£65 - 75,000)
Flexible hybrid working
27 days annual leave plus national holidays
Mental Health Support through Calm
Unlimited professional & personal learning

Qualifications

  • Strong expertise in Terraform, App Services, and Kubernetes.
  • Experience in creating and modifying Terraform deployments.
  • Prior experience in an operations role, ideally as a Site Reliability Engineer.

Responsibilities

  • Maintaining high levels of system performance through monitoring and performance tuning.
  • Implementing scalability and fault tolerance.
  • Collaborating with engineering teams to support high-throughput production environments.

Skills

Microsoft Azure
Terraform
App Services
Kubernetes
Fluent in English
Reliability in systems
Automation scripting

Tools

Datadog
Azure Application Insights
Terraform deployments
Job description

At Commify, we're not just a company—we're a globally connected team of innovators who love what we do. As a CPaaS leader with 25 years of groundbreaking experience, we're the force behind over 5 billion customer interactions each year, enabling businesses worldwide to connect via advanced channels like SMS, RCS, and complex mobile journeys.

Our culture is our core strength. Operating across the UK, EMEA, the USA, and Australia, we've fostered a truly diverse and connected environment, earning a consistent 4 out of 5 culture score in our employee engagement surveys. You'll join a vibrant team where your diverse experience makes a daily global impact.

We need talented people to grow a global company where everyone feels proud to belong, have a purpose and do their best to directly shape the digital future.

About The Role:

We’re on the look out for a super talented Site Reliability Engineer to join our team, you will be an integral part of our SRE team. Your focus will be on ensuring that our products and platforms perform at their best, understanding how our software interacts with both physical and Cloud infrastructure to deliver exceptional messaging solutions.

Key Responsibilities:
  • Maintaining high levels of system performance through monitoring and performance tuning
  • Implementing scalability and fault tolerance
  • Automating processes and improving operational efficiencies
  • Troubleshooting application and middleware challenges
  • Collaborating with engineering teams to support high-throughput production environments
  • Building and maintaining robust deployment pipelines
What You’ll Bring:
  • Proficiency with Microsoft Azure
  • Strong expertise in Terraform, App Services, and Kubernetes
  • Fluent in both written and spoken English
  • A genuine passion for reliability in systems
  • Experience in creating and modifying Terraform deployments
  • Prior experience in an operations role, ideally as a Site Reliability Engineer
  • Ability to work cross-functionally, take ownership of tasks, and prioritize effectively
  • Excellent communication and collaboration skills
  • Experience with monitoring solutions (e.g., Datadog, Azure Application Insights, Log Analytics)
  • Programming/scripting skills for automation (favoring PowerShell, but also comfortable with Bash, C#, Ruby, or Python)
  • Experience with web-based applications

It's desirable for you to have:

  • Familiarity with Azure DevOps pipelines
  • Experience with Microsoft Server Operating Systems
  • Understanding of service level objectives and operational requirements for cloud-based solutions
  • Comprehensive knowledge of Microsoft Azure Cloud offerings (especially in PaaS)
  • Experience with tools such as Terraform, Ansible, VSTS, ARM, Puppet, Chef, Jenkins, ELK, and Grafana
  • Understanding of DNS, Load Balancer configuration, Active Directory, and network infrastructure in the cloud
  • Experience in agile environments and methodologies including TDD, Scrum, or Kanban
  • Knowledge of monitoring and alerting systems for microservice architectures
  • Applied knowledge of cloud security best practices
What We Offer:
  • Competitive Salary (£65 - 75,000)
  • Flexible hybrid working
  • 27 days annual leave plus national holidays.
  • Enhance family leave
  • Enjoy your Birthday off - because it's your day!
  • Mental Health Support through our Wellbeing partner, Calm
  • Wellbeing leave and a Mental Health First Aider program
  • Giving back days to help support causes close to your heart
  • Unlimited professional & personal learning
  • Total Rewards including retirement planning, healthcare and life assurance

We also have epic team socials, and we know how to celebrate in style!

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.