Role Purpose
- The IT Infrastructure Supervisor is responsible for overseeing the daily operations, maintenance, and support of the organization's core IT infrastructure. This includes data centers, servers, storage, networks, and telecommunications systems.
- This is a hands‑on leadership role that combines technical expertise with team management. The supervisor leads a team of IT specialists, ensuring the stability, integrity, and security of all systems. The role is critical for minimizing downtime, supporting business operations, planning for future growth, and executing infrastructure‑related projects.
Key Responsibilities
Infrastructure Operations & Maintenance
System Management:
- Oversee the health and performance of virtualization platforms (e.g., VMware vSphere, Microsoft Hyper‑V).
- Manage and maintain core Microsoft services: Active Directory, DNS, DHCP, GPO, and File/Print services.
- Ensure the availability and performance of Azure IaaS and PaaS services (e.g., Virtual Machines, Azure SQL, Storage Accounts).
- Administer and support the Office 365 environment, including Exchange Online, SharePoint Online, Teams, and Entra ID (Azure AD).
Network Management:
- Ensure the stability and performance of all LAN/WAN/SD‑WAN switching and routing infrastructure.
- Manage the Branch network communications with SD‑WAN. Hands‑on experience with SD‑WAN infrastructure such as SOPHOS, FORTINET.
- Oversee the management of firewalls, VPN tunnels, and remote access solutions.
- Troubleshoot network performance issues using monitoring tools (e.g., packet capture, flow analysis).
- Data Center Management:
- Coordinate physical access for staff and vendors, ensuring security protocols are followed.
Backup & Recovery:
- Supervise, configure, and manage the Veeam Backup & Replication solution for both on‑premise and cloud workloads.
- Verify the success of daily backup jobs and remediate any failures.
- Plan, conduct, and document regular disaster recovery (DR) tests and data restore verifications.
Monitoring:
- Implement and maintain infrastructure‑wide monitoring tools (e.g., Azure Monitor, SolarWinds, PRTG) to proactively detect issues.
- Configure and tune alerts to minimize false positives and ensure critical issues are flagged.
- Develop and maintain performance dashboards for key systems and services.
Security Operations:
- Work directly with the Information Security team to implement and enforce security policies, standards, and controls.
- Manage infrastructure components of Identity and Access Management (IAM) across on‑prem (Active Directory) and cloud (Entra ID).
- Participate in security incident response, providing technical support to isolate and remediate threats.
Patch Management:
- Oversee the end‑to‑end vulnerability and patch management process for servers and network devices.
- Schedule and deploy patches using tools like WSUS, MECM, or Azure Update Management.
- Test patches in a staging environment and report on patch compliance status.
Access Control:
- Enforce the principle of least privilege for all systems, applications, and data.
- Conduct regular reviews of administrator‑level access and user permissions.
- Manage and audit role‑based access control (RBAC) in Azure and Office 365.
Skills & Competencies
A. INFRASTRUCTURE MANAGEMENT
- Server Administration: Expertise in Windows Server, Linux, or Unix environments (installation, configuration, patching, and maintenance).
- Virtualization: Proficiency with VMware, Hyper‑V, or other hypervisors.
- Storage Management: SAN/NAS technologies, backup and recovery systems, replication, and disaster recovery planning.
- Network Fundamentals: Understanding of LAN/WAN, routing, switching, VLANs, firewalls, and load balancers.
- Cloud Platforms: Hands‑on experience with Azure, AWS, or Google Cloud for hybrid infrastructure deployment.
- Monitoring & Performance Management: Tools such as Nagios, SolarWinds, Zabbix, or Datadog to track uptime, resource utilization, and incidents.
- Identity & Access Management (IAM): Managing Active Directory, Azure AD, Single Sign‑On (SSO), and MFA solutions.
B. PLATFORM SERVICES
- Email & Collaboration Platforms: Microsoft 365, Exchange Online, SharePoint, Teams, or Google Workspace administration.
- Enterprise Systems Integration: Understanding APIs, middleware, and data exchange between enterprise systems.
- Automation & Scripting: PowerShell, Python, or Bash scripting for automating administrative tasks.
- Patch Management & Endpoint Security: WSUS, SCCM, Intune, or similar tools for device compliance.
- Backup & Disaster Recovery (DR): Experience in designing and managing DR plans and conducting failover tests.
- ITSM Tools: Experience with ServiceNow, ManageEngine, or similar platforms for ticketing and change management.
Internal & External Interfaces
Internal customers, vendors, Service Providers, etc.
Qualifications & Experience
- Bachelors / Masters in Computer Science
- Microsoft 365, Azure, Cloud, Systems Infrastructure, Advanced Networking.
- Certifications in Microsoft and Network.
- 10‑12 Years of experience.