Job Responsibilities:
- Deep dive into technical RCA of repeat incidents and identify & track action items.
- Work with Technical SME to drive Problem management agenda to avoid any repeat RCA in Application and Infrastructure domain.
- Track open RCAs and identify any RCA that requires increased focus to meet committed service levels (e.g., hosting Problem management meetings as required, one-on-one mentoring, escalation, etc.).
- Drive both PPM and CPM (proactive and corrective Problem Management) across all incidents.
- Publish all prescribed reports/dashboards as per defined timelines.
- Own and execute all Problem Management matrices like number of problem records, RCA submission within TAT, number of open Problem Records, quality of RCA, tracking action items, and ensuring timely closure.
- Act as a point of escalation for day-to-day Problem issues and escalate to Owner Groups as required to bring the resolution of Problems back on schedule.
- Manage team and collaborate with leadership team.
- Ensure the quality of the RCAs, as requested, and identify areas for improvement using key measurements.
- Ensure reduction in incidents (including repeat) by effective Problem Management.
- Deep dive into technical resolution of repeat incidents.
- Work with Technical SME to drive incident agenda to avoid any incident in Application and Infrastructure domain.
- Lead as a point of escalation for day-to-day incident issues and escalate to Incident Owner Groups as required to bring the resolution of incidents back on schedule.
- Track open incidents and identify any incident that requires increased focus to meet committed service levels (e.g., hosting incident meetings as required, one-on-one mentoring, escalation, etc.).
Education:
Graduate degree or equivalent qualification in Computer Science, IT, Electronics & Telecommunications.
Experience Requirements:
15-18 years of relevant industry experience.