Attiva gli avvisi di lavoro via e-mail!

Site Reliability Engineer

Experteer Italy

Pisa

Ibrido

EUR 40.000 - 80.000

Tempo pieno

7 giorni fa
Candidati tra i primi

Aumenta le tue possibilità di ottenere un colloquio

Crea un curriculum personalizzato per un lavoro specifico per avere più probabilità di riuscita.

Descrizione del lavoro

An established industry player is seeking a Cloud Engineer with strong expertise in Site Reliability Engineering. This role involves maintaining internal tooling, improving service reliability, and promoting cloud-native practices. You will work with cutting-edge technologies in a dynamic environment, ensuring high performance and security for cloud applications. The ideal candidate is a proactive self-starter with a passion for automation and collaboration. Join a forward-thinking team that values innovation and offers a hybrid work culture, allowing flexibility while fostering in-person collaboration. Embrace the opportunity to make a significant impact in a thriving organization.

Competenze

  • Proven experience with cloud providers and cloud-native applications.
  • Strong coding skills in Python or Go and automation experience.

Mansioni

  • Maintain internal tooling to improve service reliability and scalability.
  • Promote SRE principles and help define SLIs and SLOs.

Conoscenze

Cloud Computing
Site Reliability Engineering (SRE)
Python
Go
Microservices Architecture
Network Topologies
Cyber Security Awareness
Distributed Systems
Automation
Communication Skills

Formazione

Bachelor's Degree in Computer Science or related field

Strumenti

AWS
Azure
Kubernetes
Containerization

Descrizione del lavoro

ION Core Banking – Cloud /

Full-time /

On-site

About us:

The ION Group is made up of innovators who provide trading and workflow automation solutions, high-value analytics, and strategic consulting to corporations, financial institutions, central banks, and governments.

More than 40% of the world’s largest companies use our solutions. We’ve achieved tremendous growth by bringing together some of the best and most successful financial technology companies in the world.

At ION, we offer careers that provide many opportunities: To invent. To design. To collaborate. To build. To transform businesses and empower people around the world to do more, faster and better than before. Imagine what you can do and experience. This is where you can do your best work.

We are looking for experienced people who are competent in the cloud and knowledgeable about the SRE (site reliability engineering) domain.

The team

The Core Architecture Team (CAT) produces and manages the core technology, methodologies, and frameworks that underpin all new or re-engineered ION products.

We provide our internal and external customers foundations and an open platform they can extend and evolve to manage their solutions independently and with reduced cost of ownership.

The ION Cloud Center of Excellence aims to support the Group's strategy toward a cloud-native offering via a cross–functional team of empowered people responsible for developing and managing the strategy, governance, and best practices for the entire Group.

Some of the team deliverables:

  1. Create the ION Cloud Infrastructure reusable by all the ION Divisions
  2. Reduce the total cost of ownership
  3. Provide guidelines and best practices for the entire organization
  4. Reduce operational complexity via automated platform configuration and deployment
  5. Provide tools that ease the developers to set up the CI environment for ION Products
  6. Governance on the development tools to increase operational efficiency
  7. Technology recommendations standardization and infrastructure and product design across the Group

Who you are

Your background is either in software development or operations/infrastructure (or both!), and you enjoy coding or automating your workflows.

You have proven experience in working with cloud providers and dealing with cloud-first applications engineered with a cloud-native mindset.

You are a self-starter individual and a constantly learning engineer who enjoys working in a team of peers.

You are open and candid about discussing solutions, problems, and improvements within your team and others in the engineering organization.

You have a passion for site reliability engineering (SRE) principles and adoption, and you are keen to start conversations with teams about the reliability, performance, and security of the applications, services, and systems.

You are an advocate of the DevOps or SRE approach, promoting loosely coupled, heavily automated, constantly monitored distributed systems, and you always plan for failure and never take anything for granted.

You are keen to raise the bar of the solutions provided by the whole engineering team (dev and ops).

You possess strong written and verbal communication skills.

You are happy to be involved in an on-call rotation when needed.

What you'll be doing

It’s fine to have some of these, the more the merrier!

The Cloud Engineer side

  1. Maintain our internal tooling and automation to improve the reliability, scalability, and observability of our services.
  2. Proactively identify and solve issues across the whole stack, together with the rest of the infrastructure and engineering teams.
  3. Contribute to raise awareness in the security and protection of the cloud, understanding how to fit these into timelines and backlog of the end team.
  4. Understand how a distributed application works, constraints, and limitations.
  5. Have strong coding and scripting experience and you are interested in improving your programming/coding knowledge (Python or Go ideally).

The Site Reliability Engineer side

  1. Promote and execute the adoption of SRE principles and raise awareness of the importance of reliability and automation.
  2. Help the team understand concepts like ownership, error budgets, and production readiness.
  3. Help define and implement SLIs, SLOs, and check SLAs to meet customer satisfaction.
  4. Work together with teams to identify and solve issues in platforms and tune services for reliability and performance.
  5. Aim to reduce toil and manual efforts with automation and repeatable and documented tooling and standard procedures.
  6. Take an active part in the incident management process to troubleshoot impacting issues in a timely manner and engage with all stakeholders involved.

Your skills, experience, and qualifications

These are must-haves!

  1. Our work language is English, hence it’s very important to be proficient with it.
  2. Extensive knowledge and experience in one of the major clouds, including AWS, Azure, GCP; with a comprehensive understanding and real-world implementation experience (We currently use AWS and Azure).
  3. Microservices in a cloud-native world: architecture, deployments, and engineering in the Kubernetes and Container space. You are familiar with how to protect services and adhere to industry standards/best practices.
  4. Understanding of network topologies, deployment methods, and constraints in the cloud.
  5. Familiarity with application development methodologies in a cloud-native environment and container-based runtime.
  6. Understanding of distributed systems is essential. You would benefit from having architectural concepts like SOA, object-oriented analysis and design, and/or client/server systems.
  7. Experience working with diverse, remote, and distributed teams across multiple regions and time zones.
  8. A proven track record as a site reliability or production engineer, and working in a consulting capacity directly with teams, to educate and provide the best solution achievable within the project constraints.
  9. Cyber Security and operations awareness: understanding the basic principles (identity and access management, least privilege, encryption, etc.) and strive towards implementing best practices and education, to establish a robust set of defenses in line with the company requirements.

Contract and locations

Locations: London, Milan, Pisa, Parma

Enjoy a hybrid work culture that offers the best of remote flexibility and in-person collaboration.

Important notes (Italy):

According to the Italian Law (L.68/99) Please note that candidates from the disability list will be given priority.

Due to the high volume of applications, only those candidates that meet the required criteria for selection will be contacted.

If you’re from a non-EU country, you must have a valid EU visa or work permit.

Ottieni la revisione del curriculum gratis e riservata.
oppure trascina qui un file PDF, DOC, DOCX, ODT o PAGES di non oltre 5 MB.