Overview
- Provide L2 operational support for Windows Server (2016/2019/2022/2025) in on-premises and multi-cloud environments.
- Support cloud operations predominantly on Amazon Web Services, Microsoft Azure and Google Cloud Platform.
- Hands‑on experience with cloud services including: EC2, S3, IAM, EBS, CloudWatch, Systems Manager (SSM), AWS Backup, Security Groups, VPC, Azure Virtual Machines, Azure Storage, Azure Monitor, Azure Automation, Azure Backup, Compute Engine, Cloud Storage, Cloud Monitoring, Cloud IAM.
- Manage and support Active Directory, DNS, DHCP, Group Policy Objects (GPO), WSUS, and Windows clustering.
- Monitor and maintain Windows workload performance, availability, and ensure cloud security baseline across cloud platforms.
- Participate in 24/7 shift rotation to provide round‑the‑clock operational support.
Operating System Patch Management
- Perform comprehensive OS patching for Windows Server environments using WSUS, SCCM, AWS Systems Manager, and Azure Update Management.
- Execute monthly and quarterly patch cycles with coordination and approval workflows.
- Understand basic knowledge of Linux OS patching using YUM/DNF and cloud‑native patch management tools.
- Deep expertise in Wintel Operating System patching, including pre‑patch validation, deployment, and post‑patch verification.
- Track patch compliance and generate reports for audit and compliance purposes.
- Coordinate patch windows and communicate with stakeholders.
Application Deployment & Troubleshooting
- Deploy and configure applications on Windows Server operating systems.
- Troubleshoot application issues at the OS level, including permissions, services, registry, and performance.
- Support application teams with OS‑level diagnostics and resolution.
- Perform application log analysis and performance tuning.
- Collaborate with development teams to resolve infrastructure‑related application problems.
- Resolve incidents and service requests related to Wintel systems via ITSM platforms (ServiceNow, Jira, etc.).
- Follow ITIL processes for Incident, Problem, Change, and Request Management.
- Create and update tickets with detailed documentation and resolution steps.
- Escalate complex issues to Level 3 engineers and track resolution progress.
- Participate in Change Advisory Board (CAB) reviews and change implementations.
- Maintain SLAs and ensure timely ticket resolution.
Security & Compliance
- Execute CIS (Center for Internet Security) security remediations and hardening baselines.
- Implement and review IAM permissions using IAM Access Analyzer and least privilege model.
- Perform Vulnerability Management System (VMS) remediation based on scan findings.
- Execute Cloudscape recommendations in collaboration with InfoSec teams.
- Work on Security threat detection tools and perform remediation.
- Support security compliance scanning and remediation activities.
- Maintain security configurations and monitor for security alerts.
- Implement and maintain SSL certificate management and renewal processes.
- Demonstrate basic knowledge of container technologies (Docker, Kubernetes, ECS, EKS, AKS, GKE).
- Familiarity with DevSecOps practices and tools used in Singapore Government technology stack (SHIP-HATS).
- Support containerized Windows applications where applicable.
- Understand CI/CD pipeline concepts and security integration.
Automation & Scripting
- Develop and maintain PowerShell scripts for routine tasks, automation, and remediation.
- Utilize AWS CLI, Azure CLI, and gcloud CLI for cloud operations.
- Create and execute SSM Documents for automated remediation and configuration management.
- Automate repetitive operational tasks to improve efficiency.
Backup & Disaster Recovery
- Implement and maintain backup and recovery strategies for Windows servers in cloud environments.
- Perform backup validations and participate in disaster recovery testing.
- Support business continuity planning activities.
- Document and test recovery procedures.
Documentation & Knowledge Management
- Create and maintain technical documentation, knowledge articles, and standard operating procedures (SOPs).
- Document troubleshooting steps, configurations, and remediation procedures.
- Maintain runbooks for common operational tasks.
- Contribute to team knowledge base and continuous improvement initiatives.
Monitoring & Observability
- Configure and maintain monitoring using CloudWatch, Azure Monitor, and GCP Cloud Monitoring.
- Set up alerts, alarms, and notifications for critical systems.
- Analyze logs and metrics to identify and resolve issues proactively.
- Support integration with centralized monitoring and observability platforms.
Audit & Compliance Support
- Participate in internal external audits.
- Provide evidence and documentation for compliance requirement.
- Support audit remediation activities.
- Maintain compliance with government security frameworks and standards.