Site Reliability Engineering Senior Manager

Sourceo Pte Ltd
Singapore
USD 80,000 - 120,000
Job description

Roles & Responsibilities

Qualifications

Requirements

  1. Degree in IT, Computer Science or related field
  2. Minimum 10 years of root cause analysis (RCA) exposure & involvement leading discussions as a problem manager or incident commander
  3. In depth understanding of Public/Private/Hybrid cloud solutions
  4. Hands on experience with popular CI/CD tools like Jenkins, Nexus, SonarQube, Bitbucket etc.
  5. Good exposure to logging & monitoring tools like Splunk, Dynatrace, Prometheus, Grafana, ELF/ELK
  6. Good understanding of cloud native technologies like Containers, Kubernetes etc.
  7. Develop & enhance production monitoring & management capabilities leveraging existing platforms & tools
  8. In depth understanding of Incident & Problem Management functions & activities
  9. Good understanding of Identity and access management
  10. Software incident & problem management
  11. Work with stakeholders & command centre in trouble shooting, escalating & solutioning critical site incidents
  12. Proficiency in event management tools and platforms
  13. Familiarity with ITIL (Information Technology Infrastructure Library) practices related to Incident Management, Problem Management, Change Management and Event management
  14. Experience with AI/ML technologies and their application in incident analysis
Get a free, confidential resume review.
Select file or drag and drop it
Avatar
Free online coaching
Improve your chances of getting that interview invitation!
Be the first to explore new Site Reliability Engineering Senior Manager jobs in Singapore