Hello, I hope you are doing good. Position – Site Reliability Engineer
Location – France (REMOTE)
Language - English
Experience - 8 Years
Duration - 6 months Contract, It's extendable.
NOTE - Banking OR BFSI domain experience mandatory
Job Description
Primary Responsibilities:
- Develop software to make infrastructure services self-managing and self-service.
- Deliver continuous service improvement by developing Infrastructure as Code.
- Eliminate manual, repetitive, automatable, tactical tasks that are devoid of value.
- Improve system performance, make effective use of resources, distribute load, and reduce latency.
- Identify SLOs (Service Level Objectives) to meet availability and latency objectives.
- Develop pro-active monitoring solutions that alert on symptoms and not just on outages.
- Perform detailed root cause analysis (RCA) on incidents and outages to prevent future issues.
- Partner with development teams to improve services via rigorous testing and release procedures.
- Identify technical debt and partner with application teams to build remediation plans.
- Develop standard operational procedures and produce effective documentation.
- Analyse workloads and devise suitable cloud migration strategies where appropriate.
- Ensure all project/investment workloads are delivered according to plans and budget defined.
- Liaise with infrastructure Control and IT Risk teams to satisfy internal and external audit requests.
- Deputise for team lead when required and act accordingly.
- Identify cost-saving and optimization opportunities across the group.
- Build strong working relationships across the organization.
Essential Skills:
- Exceptional knowledge of PowerShell, including automation, API integration, and modularization.
- Exceptional skills in Microsoft Windows Server internals and related technologies.
- Excellent skills in managing and maintaining Active Directory, DHCP, DNS, LDAP, and Kerberos.
- Advanced knowledge of clustering, high-availability, replication, and disaster recovery techniques.
- Ability to tune network, storage, server, and virtualization layers for optimal performance and reliability.
- Ability to interpret and implement CIS security hardening recommendations in a controlled manner.
- Extensive experience in hardware performance monitoring and tuning complex low latency systems.
- Fluent in backup and recovery processes and procedures.
- Excellent performance tuning skills, with in-depth knowledge of system internals, performance counters, and analysis tools.
- Awareness of security and auditing requirements in a regulated environment.
Highly Desirable:
- Experience in writing and managing plays/playbooks on AWX / Ansible Tower.
- Networking protocols (TCP/IP, DNS, DHCP, VLANs).
Location: Paris, Ile-de-France, France