Enable job alerts via email!

Site Reliability Engineer- Remote

Cisco

San Francisco (CA)

Remote

USD 90,000 - 150,000

Full time

8 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative tech firm is seeking a Site Reliability Engineer to join their dynamic team. This role involves managing the FedRAMP Cloud environment, ensuring service quality, and collaborating with cross-functional teams. You will leverage your expertise in DevOps and GitOps to enhance operational processes and automate workflows. The ideal candidate will have a strong understanding of cloud infrastructure, Linux internals, and monitoring tools. This position offers an exciting opportunity to contribute to cutting-edge projects in a fast-paced environment, making a significant impact on the company's success.

Benefits

Comprehensive health insurance

401(k) plan with company matching

Disability coverage

Life insurance

Paid holidays

Paid time off (PTO)

Volunteer days

Stock purchase options

Qualifications

Experience with AWS and managing cloud environments.
Strong background in DevOps practices and GitOps methodologies.

Responsibilities

Manage and enhance Cisco Spaces FedRAMP GovCloud environment.
Develop automated scripts to improve operational efficiency.
Ensure high availability and reliability through proactive monitoring.

Skills

DevOps

GitOps

Linux OS Internals

Cloud Infrastructure

Automation

Monitoring Tools (Prometheus, Grafana, ELK Stack)

Containerization (Kubernetes)

CI/CD Pipelines

Education

Bachelor's Degree in Computer Science or related field

Relevant certifications (AWS, DevOps)

Tools

AWS

Kubernetes

Jenkins

Git

GitLab

Applications are accepted until further notice

Who we are

Cisco Spaces is an industry-leading indoor location as a service solution that provides insights into the behavior of end-user devices and network-connected objects in any location with wireless connectivity. It enables customers to make informed business decisions, optimize operations, and enhance experiences. Cisco Spaces integrates multiple location-based services into a unified platform and is designed as a zero-touch SaaS Cloud product.

The TechOps team excels in leveraging the scalability and flexibility of Cloud Infrastructure to respond swiftly to changing business needs, operate proactively, and deliver services at scale with high SLA.

The team’s primary focus is to improve system operations through automation, maintaining predefined SLAs, and minimizing costs. CloudOps develops processes to measure system effectiveness and identify improvements, and is responsible for refining software tools and dashboards within the cloud environment.

CloudOps also monitors the health and security of the cloud environment 24/7/365, within Cisco's framework, and automates information gathering on current specifications. The team stays informed about new technologies and recommends solutions to management, oversees orchestration tooling, and conducts compliance audits and reporting.

What you’ll do

As a Site Reliability Engineer, you will utilize various tools and integrations to deliver foundational services vital to Cisco's business functions through GitOps. We seek an engineer proficient in DevOps and GitOps to lead a flexible, multi-skilled team responsible for maintaining the Cisco Spaces FedRAMP environment. You will operate, maintain, and enhance all aspects of the FedRAMP Cloud and serve as the point of contact for the Bangalore CloudOps team. The FedRAMP standards follow NIST 800-53 control policies, with regular audits to ensure compliance.

Responsibilities

Manage Cisco Spaces FedRAMP GovCloud (hosted in AWS) environment.
Ensure quality, performance, robustness, and scalability of services; perform bug fixes and issue triaging.
Collaborate with the CloudOps Bangalore team.
Evaluate OS, application hotfixes, patches, releases, and deployments.
Proactively engage with or create cross-functional teams to solve problems or add value.
Develop and present technical strategies and ideas for feedback.
Influence others to support or implement ideas and strategies through collaboration.
Resolve issues and set SLOs, with monitoring and logging for feature measurement.
Ensure high availability and reliability through proactive monitoring and timely resolution.
Develop automated scripts and tools to reduce manual workload and improve efficiency.
Deep understanding of Linux OS internals to optimize performance and troubleshoot.
Implement comprehensive monitoring using tools like Prometheus, Grafana, or ELK Stack.
Configure load balancing, SSL certificates, and manage HTTP/HTTPS traffic, especially in AWS environments.
Leverage cloud-native services for building resilient, scalable applications.
Deploy and manage containerized applications using Kubernetes on AWS EKS.
Design and implement CI/CD pipelines with Jenkins, Git, or GitLab.
Provide on-call support.

Message to applicants applying to work in the U.S. and/or Canada: The posted salary range reflects the projected hiring range for new full-time hires in these locations, excluding benefits and equity. Salaries depend on location, skills, experience, and education. The recruiter can share more details during the hiring process.

U.S. employees have access to comprehensive health insurance, a 401(k) plan with Cisco matching, disability coverage, life insurance, paid holidays, PTO, volunteer days, and stock purchase options. Incentive pay for sales roles is based on performance and attainment levels, with specific rates and caps described during hiring. Cisco is an Equal Opportunity Employer and considers qualified applicants regardless of protected characteristics. We also consider applicants with arrest and conviction records on a case-by-case basis.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

3 days ago

Be an early applicant

Staff Software Engineer, Reliability Engineer - Store Systems & Services (Remote)

Lensa

Atlanta

Remote

USD 120,000 - 190,000

Yesterday

Be an early applicant

Site Reliability Engineer- Remote

Cisco

San Francisco (CA)

Remote

USD 90,000 - 150,000

Full time

Job summary

Benefits

Qualifications

Responsibilities

Skills

Education

Tools

Job description

Similar jobs

Site Reliability Engineer (SRE)

San Francisco

Remote

USD 90,000 - 150,000