Enable job alerts via email!

Technical Program Manager - Data Ops/SRE

Lam Research

California, Fremont (MO, CA)

On-site

USD 90,000 - 140,000

Full time

4 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An innovative firm is seeking a seasoned Site Reliability Engineer to enhance the reliability of their enterprise data platform. This role involves collaborating with cross-functional teams to implement best practices in data operations, cloud computing, and observability enhancements. You will be at the forefront of assessing current capabilities and driving future transformations, ensuring the platform's resilience and efficiency. If you have a passion for technology and a track record in SRE, this is an exciting opportunity to make a significant impact in a dynamic environment.

Qualifications

  • 8-10 years of experience in relevant fields with a Bachelor's degree.
  • 5+ years as an SRE/Data operations lead for enterprise data platforms.

Responsibilities

  • Evangelize SRE mindset across data platform teams.
  • Collaborate with Engineering and Architecture teams for future state architecture.

Skills

Site Reliability Engineering (SRE)
Data Operations
Cloud Computing
Troubleshooting
Root Cause Analysis
Database Query Development
Web Application Programming

Education

Bachelor's degree in Engineering

Tools

MS Azure
ServiceNow

Job description

What You'll Do
  • Evangelize SRE mindset across data platform teams, including mapping SLOs, SLAs, and SLIs for data services.
  • Assess current platform capabilities, use cases, and future transformation requirements to benchmark data operations reliability.
  • Collaborate with Engineering and Architecture teams, as well as internal and external stakeholders, to map the future state architecture for data platform and services reliability, including SaaS, PaaS, and IaaS components.
  • Assess current tooling and observability capabilities; develop and implement a roadmap for future observability enhancements.
  • Support disaster recovery planning and testing in partnership with infrastructure, application teams, and domain architects.
  • Assist in troubleshooting operational issues, leading root cause analysis and corrective actions.
Who We're Looking For

Education: Typically requires a minimum of 8-10 years of experience with a Bachelor's degree in Engineering or relevant field.

  • 5+ years as an SRE / Data operations lead for an enterprise data platform and MS Azure data services.
  • 3+ years in web application programming (Java, .NET, JSP, JavaScript, Servlet).
  • 3+ years in database query development and design, specifically SQL Server.
  • 3+ years of experience in cloud computing and automation.
  • 3+ years of experience in ITSM, with exposure to tools such as ServiceNow.
  • Experience with Big Data platform reliability is highly preferred.
Our Commitment

We believe every individual should feel valued, included, and empowered to reach their full potential. Bringing diverse perspectives leads to extraordinary results.

Lam Research (

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.