Attiva gli avvisi di lavoro via e-mail!

Site Reliability Engineer

Crédit Agricole Italia

Reggio Emilia

In loco

EUR 40.000 - 70.000

Tempo pieno

19 giorni fa

Aumenta le tue possibilità di ottenere un colloquio

Crea un curriculum personalizzato per un lavoro specifico per avere più probabilità di riuscita.

Descrizione del lavoro

Una leading company nel settore bancario cerca un Site Reliability Engineer per il team di infrastruttura. Il candidato ideale avrà competenze in Terraform, Kubernetes e automazione, con responsabilità per la sicurezza e l'ottimizzazione delle prestazioni. Offriamo opportunità di crescita professionale e formazione continua.

Servizi

Autonomia e responsabilità
Crescita professionale
Formazione continua

Competenze

  • Ottima conoscenza di Terraform e Ansible.
  • Esperienza nella gestione di Kubernetes e tecnologie di containerizzazione.
  • Competenze di scripting in Bash, Python o Go.

Mansioni

  • Assicurare la disponibilità delle piattaforme attraverso monitoraggio e gestione degli incidenti.
  • Automatizzare il deployment e la gestione dei servizi.
  • Collaborare con team per pianificare le risorse future.

Conoscenze

Terraform
Ansible
Kubernetes
Scripting
Networking
Troubleshooting

Descrizione del lavoro

Following the creation of a new internal structure, we are looking for an experienced Site Reliability Engineer (SRE) to join our Infrastructure team.

Responsibilities:
  1. System Reliability: Ensuring the reliability and availability of our platforms and technological systems through robust monitoring, reporting, and incident response procedures.
  2. Infrastructure Automation: Automating the deployment, scaling, and management of services and infrastructure components for critical applications like digital channels and branches.
  3. Resource Planning: Collaborating with cross-functional teams to forecast and plan future resource requirements for all infrastructure systems.
  4. Performance Optimization: Analyzing platform performance to improve efficiency, ensuring an optimal experience for users and end customers.
  5. Incident Management Support: Participating in troubleshooting sessions, supporting operational and application teams, analyzing monitoring data and root causes, and proposing solutions.
  6. Security: Supporting implementation and maintaining security best practices, participating in vulnerability assessments and threat mitigation.
  7. Continuous Improvement: Improving system reliability through root cause analysis, incident reporting, and proactive maintenance and evolution of systems and platforms.
Required Experience:
  • Excellent knowledge of Terraform and Ansible
  • Understanding of containerization technologies (e.g., Docker, containerd)
  • Expertise in Kubernetes management and components (e.g., ingresses, monitoring stacks, custom autoscalers)
  • Strong troubleshooting skills
  • Understanding of delivery systems (e.g., Helm, GitOps)
  • Knowledge of at least one major cloud provider
  • Scripting and programming skills (e.g., Bash, Python, Go)
  • Understanding of networking
  • Experience with databases like Oracle DB, MongoDB, PostgreSQL
Nice to Have:
  • Experience with GCP, AWS, Azure
  • Experience with distributed systems such as caching systems (e.g., Redis), message brokers (e.g., RabbitMQ), log collection systems (e.g., ELK)
What We Offer:
  • Autonomy and responsibility: freedom to choose, try, fail, and learn
  • Career growth: evaluations every six months to guide your development
  • Continuous training: access to courses and industry expert learning opportunities

Location: Reggio Emilia, Italia

Ottieni la revisione del curriculum gratis e riservata.
oppure trascina qui un file PDF, DOC, DOCX, ODT o PAGES di non oltre 5 MB.