Enable job alerts via email!

Site Reliability Engineers

UPS Supply Chain Solutions (UPS SCS)

Chennai District

On-site

INR 9,00,000 - 15,00,000

Full time

Today

Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading logistics company is seeking a Site Reliability Engineer to ensure the reliability of critical services within Google Cloud Platform and RedHat OpenShift. The ideal candidate will have over 4 years of experience in site reliability engineering and a strong proficiency in Google Cloud services. Responsibilities include designing cloud infrastructure, developing automation scripts, and collaborating with teams on system reliability. This is a permanent position located in Chennai, India.

Qualifications

4+ years of experience in site reliability engineering or a similar role.
Strong scripting skills in Python or Bash.
Experience with cloud infrastructure management.

Responsibilities

Ensure the reliability and uptime of critical services and infrastructure.
Design, implement, and manage cloud infrastructure using Google Cloud services.
Develop and maintain automation scripts and tools to improve system efficiency.

Skills

Google Cloud services expertise

Automation scripting

Monitoring tool experience

Collaboration skills

Education

Bachelor’s degree in Computer Science or Engineering

Tools

Terraform

Ansible

Prometheus

Grafana

Jenkins

GitLab CI

About The Role

Role Site Reliability Engineers (SREs) in Google Cloud Platform (GCP) and RedHat OpenShift administration.

Responsibilities

System Reliability: Ensure the reliability and uptime of critical services and infrastructure.
Google Cloud Expertise: Design, implement, and manage cloud infrastructure using Google Cloud services.
Automation: Develop and maintain automation scripts and tools to improve system efficiency and reduce manual intervention.
Monitoring and Incident Response: Implement monitoring solutions and respond to incidents to minimize downtime and ensure quick recovery.
Collaboration: Work closely with development and operations teams to improve system reliability and performance.
Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle future growth.
Documentation: Create and maintain comprehensive documentation for system configurations, processes, and procedures.

Qualifications

Education: Bachelor’s degree in Computer Science, Engineering, or a related field.
Experience: 4+ years of experience in site reliability engineering or a similar role.
Proficiency in Google Cloud services (Compute Engine, Kubernetes Engine, Cloud Storage, BigQuery, Pub/Sub, etc.).
Familiarity with Google BI and AI/ML tools (Looker, BigQuery ML, Vertex AI, etc.).
Experience with automation tools (Terraform, Ansible, Puppet).
Familiarity with CI/CD pipelines and tools (Azure pipelines Jenkins, GitLab CI, etc.).
Strong scripting skills (Python, Bash, etc.).
Knowledge of networking concepts and protocols.
Experience with monitoring tools (Prometheus, Grafana, etc.).

Preferred Certifications

Google Cloud Professional DevOps Engineer
Google Cloud Professional Cloud Architect
Red Hat Certified Engineer (RHCE) or similar Linux certification

Employee Type

Permanent

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.