Job Search and Career Advice Platform

Enable job alerts via email!

Site Reliability Engineers

UPS Supply Chain Solutions (UPS SCS)

Chennai District

On-site

INR 9,00,000 - 15,00,000

Full time

Today
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading logistics company is seeking a Site Reliability Engineer to ensure the reliability of critical services within Google Cloud Platform and RedHat OpenShift. The ideal candidate will have over 4 years of experience in site reliability engineering and a strong proficiency in Google Cloud services. Responsibilities include designing cloud infrastructure, developing automation scripts, and collaborating with teams on system reliability. This is a permanent position located in Chennai, India.

Qualifications

  • 4+ years of experience in site reliability engineering or a similar role.
  • Strong scripting skills in Python or Bash.
  • Experience with cloud infrastructure management.

Responsibilities

  • Ensure the reliability and uptime of critical services and infrastructure.
  • Design, implement, and manage cloud infrastructure using Google Cloud services.
  • Develop and maintain automation scripts and tools to improve system efficiency.

Skills

Google Cloud services expertise
Automation scripting
Monitoring tool experience
Collaboration skills

Education

Bachelor’s degree in Computer Science or Engineering

Tools

Terraform
Ansible
Prometheus
Grafana
Jenkins
GitLab CI
Job description
About The Role

Role Site Reliability Engineers (SREs) in Google Cloud Platform (GCP) and RedHat OpenShift administration.

Responsibilities
  • System Reliability: Ensure the reliability and uptime of critical services and infrastructure.
  • Google Cloud Expertise: Design, implement, and manage cloud infrastructure using Google Cloud services.
  • Automation: Develop and maintain automation scripts and tools to improve system efficiency and reduce manual intervention.
  • Monitoring and Incident Response: Implement monitoring solutions and respond to incidents to minimize downtime and ensure quick recovery.
  • Collaboration: Work closely with development and operations teams to improve system reliability and performance.
  • Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle future growth.
  • Documentation: Create and maintain comprehensive documentation for system configurations, processes, and procedures.
Qualifications
  • Education: Bachelor’s degree in Computer Science, Engineering, or a related field.
  • Experience: 4+ years of experience in site reliability engineering or a similar role.
  • Proficiency in Google Cloud services (Compute Engine, Kubernetes Engine, Cloud Storage, BigQuery, Pub/Sub, etc.).
  • Familiarity with Google BI and AI/ML tools (Looker, BigQuery ML, Vertex AI, etc.).
  • Experience with automation tools (Terraform, Ansible, Puppet).
  • Familiarity with CI/CD pipelines and tools (Azure pipelines Jenkins, GitLab CI, etc.).
  • Strong scripting skills (Python, Bash, etc.).
  • Knowledge of networking concepts and protocols.
  • Experience with monitoring tools (Prometheus, Grafana, etc.).
Preferred Certifications
  • Google Cloud Professional DevOps Engineer
  • Google Cloud Professional Cloud Architect
  • Red Hat Certified Engineer (RHCE) or similar Linux certification
Employee Type

Permanent

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.