Your role:
Global Relay delivers enterprise services to 23,000 customers in 90 countries, including 22 of the top 25 global banks. Our infrastructure teams provide fantastic opportunities to System Administrators who are passionate about maintaining reliable Linux systems with a strong interest in automation, performance, and security. We are looking for someone who is motivated to automate 'on-prem' only solutions including Linux based OS Updates, configuration management, and custom based implementations created by our talented developers.
- Deploy, update, and monitor Linux systems
- Collaborate with development teams to reduce TOIL related to bespoke application deployments and associated tooling
- Support multiple on-premise environments
- Collaborate with application support teams to share and document knowledge and operational run books
- Scale and manage infrastructure with high throughput, availability, and storage requirements
- Assist with urgent issues including being part of an on-call rota to provide 2nd level troubleshooting support when required
- Automate all the things
About you:
We are looking for someone who is willing to grow in the role and extend their skill set as our Information Archive solution continues to evolve to meet our future architectural goals to exceed customer expectations.
- Minimum 3 years of experience as a Linux system administrator of bare metal, VM, and orchestrated deployments
- Enjoys automating away manual tasks using scripting (Bash, Python, etc.) and configuration management tools such as Ansible
- A desire to dig deep to troubleshoot, debug, and decouple the layers that comprise automated deployments and configuration management
- Ability to analyze complex systems and problems and express them in simple terms
- Experience in troubleshooting networking issues
- Experience working in Agile based teams
- A problem solver who takes initiative and is proactive
- Effortlessly self-motivates while working on team-based projects
- A well organized, thorough, and detail oriented person
- Able to keep the 'bigger picture' in mind while prioritizing conflicting demands and tasks
- Confident enough to voice your opinion, ask questions, and not afraid to suggest a better solution, without being abrasive
- Enjoys collaborating with others, including other functional teams to implement automated, scalable, stable, and efficient infrastructure
Nice to haves:
- Scripting (Bash, Python, etc.)
- Automation and Configuration Management tools (Ansible, Puppet, Chef, Salt, Fabric, etc.)
- Experience researching and advising on new technology implementations
- SQL administration (Postgres, MySQL, SQL Server, Oracle, etc.)
- Hands-on Kubernetes experience
- Containerization experience (Docker, Podman, etc.)
- Networking experience (DNS, VLANs, etc.)
- Experience with web servers and load balancers (Apache, Nginx, HAProxy, etc.)
- Monitoring and logging infrastructure (Zabbix, Splunk, ELK, Graphite, Grafana, Prometheus, etc.)
- Experience of CICD practices
- Proficiency in Jenkins and Jenkins Pipelines developed in Groovy running on a Linux foundation
- Experience with object-oriented languages (Java, C# etc.)
- Experience with performance tuning, troubleshooting, and capacity planning for Database clusters (e.g., Ceph, Cassandra)