Enable job alerts via email!

Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

Lensa

Washington (District of Columbia)

Remote

USD 100,000 - 130,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Start fresh or import an existing resume

Job summary

A leading tech company is seeking an Observability Capacity SRE Engineer to ensure the performance and scalability of their observability products. This fully remote role requires strong collaboration skills and a background in cloud-native infrastructure, making it vital for operational success. Candidates will engage in exciting projects that impact customer satisfaction and operational excellence, supporting a vast architecture with a focus on continuous improvement.

Qualifications

  • 3-5 years of experience in software engineering, DevOps, or SRE.
  • Working knowledge of CI/CD pipelines and observability tools.
  • Experience in scaling cloud services and troubleshooting.
  • Ability to collaborate across teams.

Responsibilities

  • Triage and resolve quota and capacity requests for customers.
  • Maintain stability and scalability of a distributed infrastructure.
  • Monitor platform usage to ensure availability for customer growth.
  • Collaborate with engineering to address performance issues.

Skills

Cloud-native infrastructure
Distributed microservice architectures
Strong debugging skills
Systems thinking skills
Collaboration skills

Tools

Jira
Splunk dashboards

Job description

Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)
Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

2 days ago Be among the first 25 applicants

Lensa is a career site that helps job seekers find great jobs in the US. We are not a staffing firm or agency. Lensa does not hire directly for these jobs, but promotes jobs on LinkedIn on behalf of its direct clients, recruitment ad agencies, and marketing partners. Lensa partners with DirectEmployers to promote this job for Cisco.

About Us

Splunk Cloud Operations is where critical thinking meets real-world impact. Our teams operate at the heart of the Splunk Platform, ensuring stability, performance, and continuous improvement across a massive cloud ecosystem. We work at speed, across boundaries, and with purpose — solving sophisticated, large-scale problems that directly affect customers worldwide.

The Opportunity

This isn’t your average support or SRE role. As an Observability Capacity Engineer , you’ll play a strategic role in ensuring Splunk’s Observability products scale effectively and serve customers reliably. You’ll operate at the intersection of systems engineering, platform operations, and tooling — making data-driven decisions to optimize how capacity is provisioned and how services run across a distributed architecture.

This is a high-impact role for an engineer who enjoys digging into infrastructure puzzles, building smarter systems, and acting as a connective force between Engineering, Support, and Product.

What You’ll Do

  • Triage and resolve inbound quota and capacity requests for Observability customers.
  • Fine-tune backend configurations to match customer traffic patterns and platform load.
  • Maintain stability and scalability of a shared, distributed infrastructure supporting hundreds of tenants.
  • Monitor platform usage and proactively ensure capacity is available for customer growth.
  • Collaborate with Engineering teams to identify and resolve critical performance or availability issues.
  • Define requirements and advocate for tooling improvements that reduce manual effort and speed up delivery.
  • Use your engineering mindset to drive continuous process and system optimization.
  • Support the broader Fulfillment Operations team through the construction and upkeep of business critical Splunk dashboards.

What You Bring

  • 3–5 years of experience in software engineering, DevOps, SRE, or platform operations roles.
  • Working knowledge of cloud-native infrastructure, distributed microservice architectures, and CI/CD pipelines.
  • Strong debugging and systems thinking skills — you can connect symptoms to root causes across layers.
  • Proficiency with the command line; hands-on experience with Jira or similar systems.
  • Familiarity with observability tools (e.g., metrics, logging, tracing platforms).
  • Comfortable balancing tactical execution with strategic thinking — you enjoy both shipping and shaping.
  • Strong collaboration skills and the ability to partner effectively with engineering, support, and product teams.
  • Hands-on experience building and optimizing Splunk dashboards.
  • Bonus: Experience with entitlement systems or Salesforce is a plus.

Why Join Us

  • You’ll work on systems at scale with tangible customer impact.
  • You’ll gain deep exposure to observability tooling, platform architecture, and operational strategy.
  • You’ll influence how we build, automate, and evolve capacity workflows across the company.
  • You’ll be part of a team that values autonomy, critical thinking, and cross-functional collaboration.

Splunk, a Cisco company, is an Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.

If you have questions about this posting, please contact support@lensa.com

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    IT Services and IT Consulting

Referrals increase your chances of interviewing at Lensa by 2x

Sign in to set job alerts for “Site Reliability Engineer” roles.

Continue with Google Continue with Google

Continue with Google Continue with Google

Site Reliability Engineer (FULLY REMOTE)

Washington, DC $130,000 - $175,000 1 week ago

Washington, DC $150,000 - $170,000 5 days ago

District of Columbia, United States 3 weeks ago

Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)
Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

Washington, DC
$100,000.00
-
$130,000.00
1 week ago

Washington DC-Baltimore Area
$100,000.00
-
$110,000.00
2 weeks ago

Washington, DC
$180,000.00
-
$215,000.00
4 months ago

DevOps Engineer with Ariba or SAC experience - 100% Remote

Washington, DC
$120,000.00
-
$150,000.00
1 week ago

Washington, DC
$65,000.00
-
$185,000.00
10 months ago

Arlington, VA
$90,000.00
-
$105,000.00
3 months ago

Washington, DC
$140,000.00
-
$170,000.00
3 months ago

Washington, DC $100,000 - $125,000 4 months ago

Security Engineer with Cloud Operations - 100% Remote

District of Columbia, United States $90,000 - $145,000 7 months ago

Maryland, United States $90,000 - $160,000 7 months ago

Chevy Chase, MD $115,000 - $230,000 5 days ago

Arlington, VA $90,000 - $100,000 3 weeks ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.