Enable job alerts via email!

Senior Site Reliability Engineer

Fusion Software Solutions

Hyderabad

Hybrid

INR 15,00,000 - 20,00,000

Full time

Today
Be an early applicant

Job summary

A leading technology company in Hyderabad is seeking a Senior Site Reliability Engineer to ensure the reliability and performance of production systems. The ideal candidate will have 8+ years of experience, particularly in supporting microservices and managing cloud platforms. Key responsibilities include incident management, scripting proficiency, and working with CI/CD pipelines. This role is full-time and offers a hybrid work model.

Responsibilities

  • Ensure reliability, scalability, and performance of production systems.
  • Support modern microservices architectures.
  • Manage cloud infrastructure and maintain high availability.

Skills

Site Reliability Engineering
Microservices architectures
Google Cloud Platform
Amazon Web Services
Datadog
GitHub Actions
Scripting skills (Python, Bash, Go)
Kubernetes
Incident management
Networking and load balancing

Tools

Docker
Terraform
Helm
Job description
Overview

Role : Senior Site Reliability Engineer / Application Support Engineer

Location : Hyderabad, India (Hybrid Knowledge City)

Experience : 8 - 10 years

Employment : Full - Time with Fusion Software Solutions Pvt, Ltd.

Job Description

We are seeking a Senior Site Reliability Engineer / Application Support Engineer to ensure reliability, scalability, and performance of production systems. This role involves supporting modern microservices architectures, managing cloud infrastructure, and working closely with development teams to maintain high availability and incident response readiness.

Key Responsibilities
  • 8+ years of experience in Site Reliability Engineering, DevOps, or production operations.
  • Strong experience supporting microservices architectures in production.
  • Expertise with cloud platforms both Google Cloud Platform (GCP) and Amazon Web Services (AWS).
  • Proficiency with Datadog (metrics, monitoring, logging, and APM).
  • Solid experience with GitHub (CI/CD pipelines, GitHub Actions, version control).
  • Strong scripting/programming skills (Python, Bash, Go, or similar).
  • Hands-on experience with Kubernetes, Docker, or other container orchestration tools.
  • Deep understanding of incident management, with experience in 24x7 support rotations.
  • Strong foundation in networking, load balancing, security, and system performance.
Nice to Have
  • Experience in the media or streaming industry.
  • Familiarity with Terraform, Helm, or other infrastructure automation tools.
  • Experience with incident management tools (PagerDuty, Opsgenie, etc.).
  • Knowledge of Agile/Scrum methodologies and collaboration tools (Jira, Confluence).
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.