Job Description:
- Provide hands-on SRE technical support at the squad level, ensuring 24x7 support.
- Drive transformation by continuously seeking automation opportunities for existing processes.
- Track, audit, monitor, and implement tasks across technical work streams.
- Act as a portfolio SME—understand and document core components, functionalities, and infrastructure of supported applications.
- Serve as an escalation point during on-call rotations, supporting maintenance, scheduled work, support, and release deployments.
- Assist in incident and problem management, including root cause analysis and RCA action ownership.
- Focus on continuous improvement and adherence to technical standards—enhance productivity, monitoring, tooling, and best practices.
- Manage technology updates such as server patching, certificate renewal, and compliance, with an emphasis on automation.
- Develop and implement top-tier technical solutions by monitoring industry-leading practices and adapting them to client needs.
- Leverage team and enterprise resources to develop better solutions and foster a cross-enterprise mindset.
- Contribute to shaping the overall SRE strategy and roadmap development.
Qualifications and Experience:
- 2-5 years of experience as an SRE.
- 4-5 years of related field experience.
- Bachelor’s degree in Computer Science, Mathematics, Engineering, Physics, or equivalent practical experience.
- Advanced knowledge of SRE practices and technologies such as Python, YAML, Shell scripting, Azure, Linux, Dynatrace, Prometheus, PagerDuty, Moog, Elastic, Azure Monitor, Chaos Engineering, MQ, Kafka.
- Experience in production support roles, including off-hours support.
- Ability to influence at senior and principal levels.
- Hands-on experience with SRE tools like Ansible, Azure Automation, Catchpoint.