Job Search and Career Advice Platform

Enable job alerts via email!

Ops Lead Engineer – Big Data Platform

Lebara Media Services Private Ltd

Greater London

On-site

GBP 70,000 - 90,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading technology firm in Greater London is seeking an experienced Operations Lead to manage the performance of their Azure-based data platform. The ideal candidate should have a strong background in operational leadership and incident management, along with hands-on experience with Azure tools. This role requires solving operational issues, leading stakeholder communication, and ensuring platform reliability and efficiency.

Qualifications

  • Proven track record in leading operations for large-scale data platforms.
  • Skilled in incident triage and defining/enforcing SLAs.
  • Hands-on experience with Azure tools and CI/CD processes.

Responsibilities

  • Own the day-to-day stability and performance of the Azure data platform.
  • Act as the primary point of contact for incidents and outages.
  • Define, implement, and enforce SLAs for critical datasets.

Skills

Operational Leadership
Incident & SLA Management
Azure Data Stack
Automation & CI/CD
FinOps Mindset
Monitoring & Observability

Tools

Azure Synapse
Databricks
ADF
Power BI
Job description
Role Summary:

A strategic and hands-on Operations Lead to ensure the resilience, performance, and cost-effectiveness of our Azure-based data platform. This role is at the heart of our data ecosystem,

combining platform reliability, incident response, SLA management, cost optimization (FinOps), and deployment oversight.

You will be the single point of contact for operational issues, driving rapid resolution during outages, leading communications with stakeholders, and shaping the processes that keep our

platform running smoothly and efficiently.

Responsibilities:
  • Own the day-to-day stability and performance of our Azure data platform (Synapse, Databricks, ADF, Power BI).
  • Act as the primary point of contact for incidents and outages — driving resolution, root cause analysis, and clear stakeholder communication.
  • Define, implement, and enforce SLAs for critical pipelines, datasets, and reporting assets.
  • Run FinOps forums with business stakeholders to improve cost transparency, accountability, and efficiency across the platform.
  • Oversee CI/CD pipelines and deployments, ensuring reliable, safe, and compliant delivery of data platform changes.
  • Champion monitoring, observability, and automation to detect and resolve issues proactively while reducing manual intervention.
  • Develop and maintain operational runbooks, escalation protocols, and incident playbooks to strengthen resilience.
  • Partner with data engineering and analytics teams to align operational strategy with business goals and future platform roadmap.
Skills Required:
  • Operational Leadership: Proven track record in leading operations for large-scale data platforms, ensuring stability, performance, and stakeholder trust.
  • Incident & SLA Management: Skilled in incident triage, root cause analysis, escalation handling, and defining/enforcing SLAs with cross-functional teams.
  • Azure Data Stack: Hands-on experience with Azure Synapse, Databricks, ADF, and Power BI, with the ability to guide best practices and optimisations.
  • Automation & CI/CD: Familiar with CI/CD processes and automation to streamline deployments and reduce manual intervention.
  • FinOps Mindset: Experience in cost management, usage reporting, and running forums with business stakeholders to drive accountability and efficiency.
  • Monitoring & Observability: Knowledge of modern monitoring, alerting, and data quality frameworks to ensure proactive platform health management.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.