Enable job alerts via email!

Operations Site Reliability Engineer

Broadcom Inc.

West of England

On-site

GBP 55,000 - 75,000

Full time

Yesterday
Be an early applicant

Job summary

A leading semiconductor company located in the West of England is seeking an experienced Operations Site Reliability Engineer. In this role, you will monitor production services, respond to stakeholder requests, and drive automation efforts to enhance system performance. The ideal candidate will possess extensive Linux administration experience and a background in cloud platforms. The role offers a competitive salary and various benefits including a bonus scheme and medical insurance.

Benefits

Highly competitive salary
Generous bonus scheme
Equity package
Competitive company pension
Employee stock purchase plan (ESPP)
Private Medical Insurance
Life Assurance scheme
On-site parking

Qualifications

  • 5+ years of experience in administering Linux systems.
  • Strong hands-on experience with multiple Linux distros.
  • 2+ years operational experience with cloud platforms.

Responsibilities

  • Monitor and ensure availability of production services.
  • Automate tasks to enhance application performance.
  • Coordinate incident management processes.

Skills

Linux systems administration
Automation platforms
AWS or Google Cloud Platform
Ansible
Docker containers
Networking knowledge
Scripting languages (Perl, shell, Ruby, Python)
Problem-solving

Education

Degree in Systems Engineering or Computer Science

Tools

Ansible Tower
Terraform
Jenkins

Job description

Operations Site Reliability Engineer page is loaded

Operations Site Reliability Engineer
Apply locations United Kingdom-Bristol-Almondsbury-Hempton Court time type Full time posted on Posted 30+ Days Ago job requisition id R022662

Please Note:

1. If you are a first time user, please create your candidatelogin account before you apply for a job. (Click Sign In > Create Account)

2. If you already have a Candidate Account, please Sign-In before you apply.

Job Description:

The primary responsibilities include:


· To form part of a critical operations function that is responsible for the monitoring, availability and performance of production services.

· Responding to stakeholder requests within agreed timescales or SLO

· Drive automation to reduce failures, manual tasks and therefore improving overall application performance and availability.

· Perform systems administration activities to ensure the smooth operation of applications across multiple platforms

· Coordinate and communicate with impacted stakeholders as per incident management process.

· Demonstrate ownership of events and incidents through to restoration

· Perform daily shift handovers to peers and management across multiple geographies.

· Support maintenance activities which impact production applications.

· Support critical systems that handle sensitive and proprietary data

· Create, maintain and update work instructions for troubleshooting and supporting applications.

· Contribute to the planning of application/infrastructure releases and configuration changes

· Provide input to administering and maintaining all production environments

· Patching and upgrade of existing applications

· Provide feedback and coaching to upstream teams (both internal and vendors) to reduce escalations and to continually improve overall experience for customers.

Professional Experience Required

  • A degree in Systems Engineering, Computer Science or related fields with related experience preferred
  • 5+ years of experience administering Linux systems
  • Strong hands-on experience of variants of linux distros
  • 2+ years Operational experience of working with Amazon Web Services or Google Cloud Platform
  • Experience of working with an automation platform to automate repetitive actions that reduce manual effort
  • Familiarity with deployment tools such as Ansible Tower and Jenkins
  • Experience in carrying out large deployments to global infrastructure
  • Proficient with orchestration/configuration tools such as Ansible and Terraform
  • Strong working knowledge of networking, packet tracing, understanding latency and throughput in order to pinpoint or resolve application issues.
  • Thorough knowledge of HTTP(S), SMTP, TLS/SSL, DNS, LDAP, Kubernetes and Docker containers
  • Experience of system/application administration in a distributed, customer-facing, high-availability and large-scale environments
  • Experienced and confident in at least one scripting language such as Perl, shell, Ruby or Python.
  • Experience of tuning and optimising monitoring systems.

Personal Experience Required

  • A strong team player with the ability to grasp new technologies, adapt to change in methodologies, with a focus on delivery
  • Extensive troubleshooting and problem-solving skills with respect to application technologies
  • Ability to remain calm and work well under pressure
  • A keen interest and desire to work within the security arena
  • Ability to communicate effectively at all levels up to senior management.

Benefits

  • Highly competitive salary
  • Generous bonus scheme
  • Equity package
  • Competitive company pension
  • Employee stock purchase plan (ESPP)
  • Private Medical Insurance (Individual or family)
  • Life Assurance scheme (up to 4x salary)
  • Ample on-site parking.

This role will need to participate in weekends and holidays on-call support as and when required.

Broadcom is proud to be an equal opportunity employer. We will consider qualified applicants without regard to race, color, creed, religion, sex, sexual orientation, gender identity, national origin, citizenship, disability status, medical condition, pregnancy, protected veteran status or any other characteristic protected by federal, state, or local law. We will also consider qualified applicants with arrest and conviction records consistent with local law.

If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.

Welcome! Thank you for your interest in Broadcom!

We are a global technology leader that designs, develops and supplies a broad range of semiconductor and infrastructure software solutions.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.