Key Responsibilities
Platform & Systems Engineering
- Administer and maintain heterogeneous OS environments including Solaris and Linux across production and non-production zones.
- Plan and execute upgrades, patches, and platform enhancements across infrastructure components.
- Enforce security baselines, OS-level hardening, and deploy vendor/OS-specific vulnerability fixes.
- Track end-of-life status of platform components and proactively manage refresh cycles.
- Oversee renewal schedules for expiring credentials, passwords, and server-side certificates.
- Implement and maintain backup policies, system snapshots, and database archival processes to support data protection and recovery.
- Design and test DR strategies for server infrastructure; conduct routine failover and recovery simulations to validate readiness.
- Coordinate UAT activities for application teams and maintain integrity of the UAT environments.
- Maintain infrastructure documentation including architecture diagrams, admin manuals, and recovery procedures.
- Provide deep-dive (L3) technical troubleshooting support for complex platform issues and escalations.
Incident & Root Cause Handling
- Deliver Tier 2 response and resolution for OS-related issues within agreed SLA timelines.
- Perform in-depth analysis of recurring faults and design countermeasures to improve resilience.
Change & Release Operations
- Manage deployment windows for patching and upgrades under the established change governance framework.
- Build and package deployment artefacts using internal automation tools or scripts.
Asset & Configuration Governance
- Maintain a centralised repository of infrastructure assets and configuration profiles for servers and middleware.
- Ensure audit readiness of system configurations and change logs.
Capacity & Performance Tuning
- Track server utilisation metrics and forecast capacity needs in line with expected load trends.
- Conduct performance diagnostics and fine-tune system parameters to meet platform SLAs.
Process Development & Compliance
- Create and refine IT operations workflows aligned to corporate governance, audit, and security standards.
Project Support & Delivery
- Contribute to project rollouts by provisioning, configuring, and validating system infrastructure.
- Lead infrastructure enhancement workstreams in accordance with organisational best practices.
Knowledge Management
- Maintain system documentation, runbooks, operational standards, and configuration histories for continuity.
Requirements
- A Bachelor's degree in Computer Science, Information Technology, or related discipline.
- At least 8 years of experience in Unix/Linux system administration, preferably in high-security or regulated environments.
- Practical expertise in managing SAN-connected systems and integrating storage into Unix/Linux workflows.
- Solid experience with OS patching, vulnerability mitigation, and compliance alignment.
- Familiarity with enterprise-class backup, monitoring, and system health solutions.
- Comfortable with shell scripting and automation in Unix/Linux environments.
- Understanding of database support and data center operating procedures is advantageous.
- Willingness to participate in a 24/7 rotational support roster and respond to after-hours change events.
- Proven ability to communicate complex issues clearly across business and technical teams.
- Team player with a hands-on mindset, high ownership, and a problem-solving attitude.
- Able to handle multiple priorities and operate effectively under pressure.
- Certifications in Unix/Linux system administration (e.g., RHCSA, RHCE, Solaris Admin) required.
- Cloud administration certifications (AWS, Azure, or similar) are highly desirable.
Reg. No. R1878306
EA License no.: 16S8066