Enable job alerts via email!

Site Reliability Engineer 4- RS1013439

Juniper Networks, Inc

Westford (MA)

On-site

USD 107,000 - 154,000

Full time

13 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Juniper Networks is seeking a full-time Site Reliability Engineer to join their team. The role involves maintaining and improving the production environment for cloud services, ensuring high availability and performance. Candidates should have a strong background in DevOps/SRE, particularly with AWS and GCP, and be able to manage incident lifecycles effectively.

Benefits

Medical benefits

401(k)

Vacation

Sick leave

Parental leave

Qualifications

Minimum 5 years of DevOps/SRE experience.
At least 3 years working with AWS and/or GCP.
Understanding of distributed systems and data management technologies.

Responsibilities

Manage system availability and service levels of cloud infrastructure.
Proactively monitor and diagnose failures in production.
Participate in on-call rotations to resolve issues.

Skills

AWS

GCP

Linux

Shell Scripting

Kubernetes

Jenkins

Prometheus

CloudWatch

Education

Bachelor's degree in Computer Science

Tools

Terraform

CloudFormation

Site Reliability Engineer 4

Location: REMOTE / anywhere in the U.S

Juniper is seeking a full-time SRE to join our talented team and support high-quality technology solutions that revolutionize wireless and wired networks, powered by Artificial Intelligence in the cloud. We provide services through SaaS applications to several enterprises, including Fortune 100 and Fortune 500 customers. You will be responsible for maintaining and improving the company's production environment for rapid scaling and outstanding performance, ensuring stellar cloud uptime and reliability. Your primary responsibilities will include incident management and release management across cloud instances in various regions.

About Juniper

Juniper is changing what's possible in networking. We're building networks that exceed customer expectations. To continue excelling, we seek radical thinkers, eternal optimists, and energized personalities. We foster a culture of innovation, support chances to grow ideas, and are led by thoughtful, inclusive leaders. Join us to be part of a transformative journey in networking.

Our Values

We aim to deliver network experiences that transform how people connect, work, and live, guided by our core values: Being Bold, Building Trust, and Delivering Excellence.

Join Us

Do you want to solve complex problems, build systems that will change the Internet, and work with a world-class engineering team? If so, Juniper offers an exciting opportunity for you.

Responsibilities

Manage system availability, health, and service levels (SLAs, SLOs) of large-scale cloud infrastructure in AWS and GCP.
Proactively monitor, diagnose, analyze failures, and support software engineers in debugging production issues across microservices and distributed platforms.
Participate in on-call rotations and resolve issues in a 24x7 multi-cloud environment.
Monitor application and infrastructure metrics and performance.
Manage code releases, including pushing code and patches to the cloud.
Own the incident lifecycle, including reporting, analysis, handling, closure, and writing RCAs.
Focus on scalability, reliability, high availability, performance, maintainability, and operational challenges.
Write and maintain runbooks for automated processes and bots.
Perform capacity planning based on performance and utilization data.
Conduct after-hours infrastructure updates and maintenance.
Follow SRE best practices and procedures.

Basic Qualifications

Bachelor's degree in Computer Science, Computer Engineering, or equivalent.
Minimum 5 years of DevOps/SRE experience.
At least 3 years working with AWS and/or GCP.
Technical experience with EC2 (GCE), IAM, S3 (GS), Kubernetes, Jenkins, Prometheus, CloudWatch (Stackdriver), Linux, and Shell Scripting.
Preferred: Basic understanding of Terraform, CloudFormation, or other IaC tools.
Understanding of distributed systems and data management technologies, including relational and non-relational databases.
Experience operating large-scale cloud-based distributed applications.
Ability to manage in-flight issues effectively.

Additional Information

Salary Range: $107,008.00 - $153,824.00 annually. Compensation includes medical benefits, 401(k), vacation, sick leave, and parental leave. The actual salary may vary based on location, experience, and other factors. This is an at-will employment position with the right to modify compensation and benefits at any time.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs