Overview
Production Environment Management
- Maintain and monitor AWS and Teraco environments to ensure 99.9% uptime.
- Implement and manage Infrastructure as Code (IaC) using Terraform.
- Optimize scalability security and performance of AWS services (EC2 RDS ECS Lambda etc.).
- Utilize monitoring tools like AWS CloudWatch, Datadog or Grafana for system visibility and alerting.
Patching and Upgrading
- Perform regular patching and upgrades based on the approved patch policy.
- Plan and execute maintenance windows with minimal service disruption.
- Address vulnerabilities using scan results (e.g. Rapid7) to ensure environment compliance.
CI / CD Pipeline Management
- Deploy approved GitLab pipelines into production based on JSM change requests.
- Collaborate with Development and QA teams for seamless code deployment.
- Optimize pipelines for performance security and reliability.
Incident Response and Monitoring
- Proactively detect and respond to system issues.
- Troubleshoot incidents to reduce downtime and recovery time.
- Participate in the on-call rotation and perform Root Cause Analysis (RCA) for production incidents.
Change Management
- Review and implement approved changes in collaboration with the change management team.
- Maintain clear documentation and communication for all production changes.
Automation and Optimization
- Automate repetitive DevOps tasks (e.g. deployments patching monitoring).
- Optimize AWS resource usage to maintain cost-efficiency without compromising performance.
Collaboration and Documentation
- Work cross-functionally with Product Owners Developers QA Security and Auditors.
- Maintain comprehensive documentation of systems changes and incident resolutions.
Requirements
Education & Certifications
Bachelor's Degree in Computer Science, Information Technology or related field or equivalent work experience.
One or more of the following certifications (current and maintained):
- AWS Certified DevOps Engineer
- AWS Certified Solutions Architect Associate
- AWS Certified SysOps Administrator Associate
- Microsoft Certified: Azure Administrator Associate
- LPI Linux Essentials
- Red Hat Certified System Administrator
Experience
- Minimum of 3 years in a DevOps or Site Reliability Engineering (SRE) role.
- Hands-on experience in managing production workloads in AWS.
- Proven expertise in Linux server administration.
- Experience with GitLab CI / CD or similar tools.
Education
Bachelor's Degree in Computer Science, Information Technology, or related field or equivalent work experience. One or more of the following certifications (current and maintained): AWS Certified DevOps Engineer, AWS Certified Solutions Architect Associate, AWS Certified SysOps Administrator Associate, Microsoft Certified: Azure Administrator Associate, LPI Linux Essentials, Red Hat Certified System Administrator.
Key Skills
ASP.NET, Health Education, Fashion Designing, Fiber, Investigation
Employment Type: Full Time
Experience: years
Vacancy: 1