Enable job alerts via email!

Site Reliability Engineer 4- RS1013439

Juniper Networks, Inc

Westford (MA)

On-site

USD 107,000 - 154,000

Full time

13 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Juniper Networks is seeking a full-time Site Reliability Engineer to join their team. The role involves maintaining and improving the production environment for cloud services, ensuring high availability and performance. Candidates should have a strong background in DevOps/SRE, particularly with AWS and GCP, and be able to manage incident lifecycles effectively.

Benefits

Medical benefits
401(k)
Vacation
Sick leave
Parental leave

Qualifications

  • Minimum 5 years of DevOps/SRE experience.
  • At least 3 years working with AWS and/or GCP.
  • Understanding of distributed systems and data management technologies.

Responsibilities

  • Manage system availability and service levels of cloud infrastructure.
  • Proactively monitor and diagnose failures in production.
  • Participate in on-call rotations to resolve issues.

Skills

AWS
GCP
Linux
Shell Scripting
Kubernetes
Jenkins
Prometheus
CloudWatch

Education

Bachelor's degree in Computer Science

Tools

Terraform
CloudFormation

Job description

Site Reliability Engineer 4

Location: REMOTE / anywhere in the U.S

Juniper is seeking a full-time SRE to join our talented team and support high-quality technology solutions that revolutionize wireless and wired networks, powered by Artificial Intelligence in the cloud. We provide services through SaaS applications to several enterprises, including Fortune 100 and Fortune 500 customers. You will be responsible for maintaining and improving the company's production environment for rapid scaling and outstanding performance, ensuring stellar cloud uptime and reliability. Your primary responsibilities will include incident management and release management across cloud instances in various regions.

About Juniper

Juniper is changing what's possible in networking. We're building networks that exceed customer expectations. To continue excelling, we seek radical thinkers, eternal optimists, and energized personalities. We foster a culture of innovation, support chances to grow ideas, and are led by thoughtful, inclusive leaders. Join us to be part of a transformative journey in networking.

Our Values

We aim to deliver network experiences that transform how people connect, work, and live, guided by our core values: Being Bold, Building Trust, and Delivering Excellence.

Join Us

Do you want to solve complex problems, build systems that will change the Internet, and work with a world-class engineering team? If so, Juniper offers an exciting opportunity for you.

Responsibilities
  • Manage system availability, health, and service levels (SLAs, SLOs) of large-scale cloud infrastructure in AWS and GCP.
  • Proactively monitor, diagnose, analyze failures, and support software engineers in debugging production issues across microservices and distributed platforms.
  • Participate in on-call rotations and resolve issues in a 24x7 multi-cloud environment.
  • Monitor application and infrastructure metrics and performance.
  • Manage code releases, including pushing code and patches to the cloud.
  • Own the incident lifecycle, including reporting, analysis, handling, closure, and writing RCAs.
  • Focus on scalability, reliability, high availability, performance, maintainability, and operational challenges.
  • Write and maintain runbooks for automated processes and bots.
  • Perform capacity planning based on performance and utilization data.
  • Conduct after-hours infrastructure updates and maintenance.
  • Follow SRE best practices and procedures.
Basic Qualifications
  • Bachelor's degree in Computer Science, Computer Engineering, or equivalent.
  • Minimum 5 years of DevOps/SRE experience.
  • At least 3 years working with AWS and/or GCP.
  • Technical experience with EC2 (GCE), IAM, S3 (GS), Kubernetes, Jenkins, Prometheus, CloudWatch (Stackdriver), Linux, and Shell Scripting.
  • Preferred: Basic understanding of Terraform, CloudFormation, or other IaC tools.
  • Understanding of distributed systems and data management technologies, including relational and non-relational databases.
  • Experience operating large-scale cloud-based distributed applications.
  • Ability to manage in-flight issues effectively.
Additional Information

Salary Range: $107,008.00 - $153,824.00 annually. Compensation includes medical benefits, 401(k), vacation, sick leave, and parental leave. The actual salary may vary based on location, experience, and other factors. This is an at-will employment position with the right to modify compensation and benefits at any time.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer 4- RS1013439

Juniper Networks

Westford

Remote

USD 120,000 - 160,000

13 days ago

Site Reliability Engineer

NESN

Watertown

Remote

USD 120,000 - 160,000

20 days ago

[Hiring] Site Reliability Engineer @Keeper Security, Inc.

Keeper Security, Inc.

Remote

USD 130,000 - 180,000

Today
Be an early applicant

Site Reliability Engineer (IFE DCC)

Thales

Remote

USD 90,000 - 130,000

Today
Be an early applicant

Tech Ops-Site Reliability Engineer - 30264

Splunk Inc

Connecticut

Remote

USD 90,000 - 120,000

Today
Be an early applicant

Lead Site Reliability Engineer - Remote

Davita Inc.

San Antonio

Remote

USD 106,000 - 195,000

Today
Be an early applicant

Lead Site Reliability Engineer - ITIL/ITSM

Visionyle Solutions

Remote

USD 100,000 - 130,000

Today
Be an early applicant

Site Reliability Engineer New United States-Remote

Onestudyteam

Remote

USD 100,000 - 140,000

Today
Be an early applicant

HPC Site Reliability Engineer (SRE) Engineering US, Remote Working

ORI

Remote

USD 120,000 - 160,000

Today
Be an early applicant