Enable job alerts via email!

Markets SRE Lead (Hybrid)

Citi

United Kingdom

On-site

GBP 100,000 - 125,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking global bank that is revolutionizing the financial services industry through technology. In this pivotal role, you will lead the Site Reliability Engineering initiative within the Production Management team, ensuring operational excellence and the successful adoption of SRE principles. You will work collaboratively with diverse teams to enhance service delivery and drive continuous improvement in a dynamic environment. This position offers the chance to make a significant impact while developing your career in a supportive and innovative setting. Embrace the opportunity to shape the future of global banking with cutting-edge solutions and a culture of growth.

Qualifications

  • Experience in a critical role with high business impact.
  • Excellent understanding of SRE concepts and service levels.
  • Strong experience with end-to-end observability stacks.

Responsibilities

  • Lead daily operations and implement SRE principles in Production Management.
  • Improve service levels and operational efficiencies for end users.
  • Guide development teams on application stability and supportability.

Skills

Site Reliability Engineering (SRE)
Agile methodologies (SCRUM/Kanban)
Docker
Kubernetes
Cloud Formation
Terraform
Networking
Debugging and analytical skills
Middleware technologies (MQ, Apache Kafka)

Education

Bachelor’s degree in computer science/mathematics/physics

Tools

Datadog
AppDynamics
Dynatrace

Job description

Citi, the world leading global bank, has approximately 200 million customer accounts and a presence in more than 160 countries and jurisdictions worldwide. Citi provides consumers, corporations, governments and institutions with a broad range of financial products and services, including consumer banking and credit, corporate and investment banking, securities brokerage, transaction services, and wealth management. Citi enables clients to achieve their strategic financial objectives by providing them with cutting-edge ideas, best-in-class products and solutions, and unparalleled access to capital and liquidity. A critical enabler of Citi’s mission to enable growth and economic progress is technology. Our teams are creating innovations used across the globe – we’re changing the way people bank and how the world does business.

Why Citi? Join a Global Bank that believes in its talent

Due to significant and continued investment, Citi’s technology team is growing. We’re looking for talented, driven individuals to help build the future of global banking. At Citi, you’ll have the chance to build the career you want, with the scale, inclusive culture and technology to support you. You’ll be joining a diverse team of professionals across the world in an environment focused on growth and progress. This is the opportunity to take your career to the next level through the power of Citi’s unmatched globality and vast expertise.

About the Team: ICG Production Management

Our Production Management team provides critical business and technical support to the Institutional Client Group (ICG); working collaboratively to ensure that our platforms and services operate for our clients, whenever they need them. We act as highly skilled and valued partners to our businesses. Working as part of our Production Management team, you will help deliver a world class client service and experience, by applying engineering, innovation, learning and risk management across our systems and user environments. We interact with a diverse range of people each day, collaborating to solve problems as well as to anticipate and remove them before they occur. At Citi we look to raise the bar of operational excellence by using Site Reliability Engineering (SRE) principles to implement continuous improvement across key areas like latency, availability, performance, and capacity. We welcome candidates with SRE mindsets and experience who are keen to promote the adoption of SRE culture at Citi. Citi’s distinct global network of people, data and relationships creates a mindset that allows us to identify opportunities, manage risks and connect dots for our clients in ways that others cannot. Our people really do make all the difference in our success. We’re a forward-thinking team. We’re looking for ambitious, capable professionals who thrive on collaboration and want to improve how things are done. In return, we offer rewarding work in a supportive environment, clear opportunities for progression and exciting company benefits.

About Citi and the Job

Citi has embarked on a multi-year transformation effort to simplify and modernise our bank. As part of the transformation, the Site Reliability Engineering initiative is core to our Production Management Transformation to help our business grow and be more agile and improve the time-to-market. Our mission is to implement and adopt the best practices and industry standards in Site Reliability Engineering and DevOps frameworks to improve our processes and general productivity.

Citi has created a core team to deliver SRE in Production Management and is looking to expand it with an SRE Lead in ICG Markets to implement and adopt SRE Principles and Best Practices in our ICG Production Management organisation.

This role will report to the Head of Production Resiliency and will be based out of London.

The Site Reliability Engineering Lead in Markets will be accountable for leading the daily operations and overall implementation in a complex, critical and large cross-departmental and multi-disciplinary area.

The role is part of a multi-year transformation journey that will require a successful candidate to establish best practices, motivate and promote a cultural shift that will ensure a successful adoption of SRE Principles and Practices.

The role requires a comprehensive understanding of multiple areas within a function and how they interact to achieve the objectives of the function. Applies in-depth understanding of the business impact of technical contributions. Strong commercial awareness is a necessity. Generally accountable for delivery of a full range of services to our Markets business.

Excellent communication skills required to negotiate internally, often at a senior level.

Involved in short- to medium-term planning of actions and resources for own area. Full management responsibility of a team or multiple teams, including management of people, budget and planning.

Responsibilities:

  1. Demonstrates an in-depth understanding of how Site Reliability Engineering integrates within the overall technology function to achieve objectives; requires a good understanding of the industry.
  2. Ability to operate in a global environment with on-/near-/off-shore matrix reporting structures.
  3. Implement and nurture a resilient team structure that will follow-the-sun and ensure it follows the four-eyes process.
  4. Operates in a highly regulated environment that requires in-depth understanding of the regulatory requirements and the industry implications for our technologies.
  5. Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices.
  6. Guide development teams on application stability and supportability improvements.
  7. Formulate and implement a framework for managing capacity, throughput and latency.
  8. Define and implement application onboarding guidelines and standards.
  9. Work with various team members on coaching them on how to maximize their potential, work better in a highly integrated team environment and focus on bringing out their strengths.
  10. Drives continued cost reductions and efficiencies across the portfolios supported by means of Root Cause Analysis reviews, Knowledge management, Performance tuning, and user training.
  11. Participates in business review meetings, relating technology tools strategies to business requirements.
  12. Fosters a culture that promotes transparency and innovation for increased team productivity.
  13. Coaching members of the team and outside the immediate reporting line about the best practices and recognizes anti-patterns that are quickly addressed.
  14. Implements the Agile Framework through one of its implementations like SCRUM or Kanban and ensures it integrates with overall organisation processes.
  15. Avidly communicates progress and project status across the organisation and ensures that stakeholders are managed appropriately throughout the execution period.

Qualifications:

  1. Relevant experience in a critical role with high business impact.
  2. Excellent understanding of SRE concepts (service levels, error budgets, etc.).
  3. Excellent working knowledge of key computer science concepts (networking, operating systems, virtualisation, containerisation, etc.).
  4. Polyglot developer mentality and ability to pick up new languages and skills.
  5. Excellent understanding of Software Engineering concepts like Software Development Life Cycle and GitOps.
  6. Excellent debugging and analytical skills: ability to isolate root cause across networking/infrastructure, application and database stacks.
  7. Operational experience of deploying and running services at scale on top of Docker/Kubernetes stack and a service mesh, like Istio, is a must.
  8. Operational experience with orchestration tools for CI/CD and Infrastructure-as-Code tooling (Terraform, Cloud Formation, etc.) is a must.
  9. Experience of delivering software using Agile delivery methodologies is a must (SCRUM/Kanban).
  10. Operational experience of using middleware technologies (MQ, Apache Kafka, etc.) to run services at scale is highly desirable.
  11. Strong experience with end-to-end observability stacks (Datadog, AppDynamics, Dynatrace, etc.) is highly desirable.
  12. Degree in computer science/mathematics/physics or related technical subject is highly desirable.
  13. Experience of senior stakeholder management.
  14. Consistently demonstrates clear and concise written and verbal communication skills.

Education:

  1. Bachelor’s/University degree in computer science/mathematics/physics or related technical subject is highly desirable.

#SREICGPM

Job Family Group: Technology

Job Family: Applications Support

Time Type: Full time

Citi is an equal opportunity and affirmative action employer. Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran. Citigroup Inc. and its subsidiaries ("Citi") invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View the "EEO is the Law" poster. View the EEO is the Law Supplement. View the EEO Policy Statement. View the Pay Transparency Posting.

While we're a global bank, our mission is simple: We responsibly provide financial services that enable growth and economic progress. We strive to earn and maintain the public's trust by constantly adhering to the highest ethical standards. We ask our colleagues to ensure that their decisions pass three tests: they are in our clients' interests, create economic value, and are always systemically responsible. When we do these things well, we make a positive financial and social impact in the communities we serve and show what a global bank can do.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.