Enable job alerts via email!

Compute Grid Site Reliability Engineer-AVP

Barclays UK

Singapore

On-site

USD 70,000 - 110,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a Compute Grid Site Reliability Engineer to enhance their distributed super-computer platform. This role involves improving service reliability, availability, and scalability while working within an EngOps framework. You will engage in operations, engineering, and development tasks, contributing to a culture of technical excellence. If you're passionate about technology and eager to make a significant impact in a collaborative environment, this position offers a unique opportunity to drive innovation and efficiency in a critical area of the business.

Qualifications

  • Strong technical aptitude with experience in systems administration and scripting.
  • Ability to automate processes and improve system performance.

Responsibilities

  • Ensure system reliability and scalability through proactive monitoring.
  • Develop tools to automate operational processes and enhance efficiency.

Skills

Verbal and written communication skills
Problem-solving skills
Windows/Unix Systems Administration
PowerShell scripting
Python scripting

Tools

IBM Symphony
Tibco/DataSynapse GridServer
Microsoft Azure
AWS
Splunk
Git
Chef
Jenkins
Terraform

Job description

Join us in the role as Compute Grid Site Reliability Engineer- AVP in Singapore. The Compute Grid team is responsible for building and maintaining the bank’s distributed super-computer which runs the bank’s compute intensive workloads. The system harnesses CPU capacity sourced from on-prem and public cloud. The team’s mission statement is: “To provide a stable platform for the distributed execution of computation tasks at the lowest possible price”. In this role, you will work to continuously improve the Compute Grid service, operating within the team’s EngOps framework (a mix of SRE & DevOps), taking part in support, operations, engineering, and development work on rotation.

Essential Skills/Basic Qualifications

  • Strong verbal and written communication skills.
  • Strong technical aptitude and can-do attitude along with good problem-solving skills.
  • Experience in Windows/Unix Systems Administration
  • PowerShell and Python scripting

Desirable skills/Preferred Qualifications

  • Experience with High Performance Computing software such as IBM Symphony and Tibco/DataSynapse GridServer.
  • Experience with Microsoft Azure & AWS.
  • Experience using Splunk.
  • Experience in DevOps tooling (Git, Chef, Jenkins, Terraform)

Purpose of the role

To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them.

Accountabilities

  • Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning.
  • Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring.
  • Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience.
  • Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning.
  • Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations.
  • Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth.

Assistant Vice President Expectations

  • To advise and influence decision making, contribute to policy development and take responsibility for operational effectiveness. Collaborate closely with other functions/business divisions.
  • Lead a team performing complex tasks, using well developed professional knowledge and skills to deliver on work that impacts the whole business function. Set objectives and coach employees in pursuit of those objectives, appraisal of performance relative to objectives and determination of reward outcomes.
  • If the position has leadership responsibilities, People Leaders are expected to demonstrate a clear set of leadership behaviours to create an environment for colleagues to thrive and deliver to a consistently excellent standard. The four LEAD behaviours are: L – Listen and be authentic, E – Energise and inspire, A – Align across the enterprise, D – Develop others.
  • OR for an individual contributor, they will lead collaborative assignments and guide team members through structured assignments, identifying the need for the inclusion of other areas of specialisation to complete assignments. They will identify new directions for assignments and/or projects, identifying a combination of cross functional methodologies or practices to meet required outcomes.
  • Consult on complex issues; providing advice to People Leaders to support the resolution of escalated issues.
  • Identify ways to mitigate risk and develop new policies/procedures in support of the control and governance agenda.
  • Take ownership for managing risk and strengthening controls in relation to the work done.
  • Perform work that is closely related to that of other areas, which requires understanding of how areas coordinate and contribute to the achievement of the objectives of the organisation sub-function.
  • Collaborate with other areas of work, for business aligned support areas to keep up to speed with business activity and the business strategy.
  • Engage in complex analysis of data from multiple sources of information, internal and external sources such as procedures and practices (in other areas, teams, companies, etc.) to solve problems creatively and effectively.
  • Communicate complex information. 'Complex' information could include sensitive information or information that is difficult to communicate because of its content or its audience.
  • Influence or convince stakeholders to achieve outcomes.

All colleagues will be expected to demonstrate the Barclays Values of Respect, Integrity, Service, Excellence and Stewardship – our moral compass, helping us do what we believe is right. They will also be expected to demonstrate the Barclays Mindset – to Empower, Challenge and Drive – the operating manual for how we behave.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.