Enable job alerts via email!

Staff Software Engineer, Reliability Engineer - Store Systems & Services (Remote)

Lensa

Atlanta (GA)

Remote

USD 120,000 - 190,000

Full time

3 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a forward-thinking company as a Staff Reliability Engineer, where you will lead a dynamic team focused on ensuring the reliability and performance of critical systems. This role offers the opportunity to develop and maintain software solutions while mentoring junior engineers. You will engage in tool selection, performance tuning, and production monitoring, contributing to foundational infrastructure elements. If you are passionate about reliability engineering and want to make a significant impact, this position is perfect for you. Embrace the challenge of working in a collaborative environment that fosters growth and innovation.

Qualifications

  • 3-5 years of relevant work experience in software engineering.
  • Experience with infrastructure automation tools and cloud platforms.

Responsibilities

  • Develops, tests, and maintains software for system reliability.
  • Leads and mentors junior engineers in software development practices.

Skills

Infrastructure Automation
Cloud Solutions Architecture
Monitoring Tools
Reliability Engineering Principles
Debugging Techniques
Version Control Systems
Security Frameworks

Education

Bachelor's Degree in a related field

Tools

Terraform
Ansible
Chef
Google Cloud
AWS
Prometheus
Grafana
Datadog

Job description

Staff Software Engineer, Reliability Engineer - Store Systems & Services (Remote)
Staff Software Engineer, Reliability Engineer - Store Systems & Services (Remote)

3 days ago Be among the first 25 applicants

Lensa is the leading career site for job seekers at every stage of their career. Our client, Home Depot, is seeking professionals. Apply via Lensa today!

Position Purpose

The Staff Reliability Engineer is responsible for leading a team of engineers focused on ensuring the reliability, availability, and performance of our systems and applications. As a Staff Reliability Engineer, you will be part of a dynamic team with engineers of all experience levels who help each other build and grow technical and leadership skills while creating, deploying, and supporting production systems. In addition, Staff Reliability Engineers will assist in tool selection, configuration, security, resilience, performance tuning, and production monitoring.

Staff Reliability Engineers contribute to foundational infrastructure elements that can be reused as well as architectural diagrams and other system-related documentation. As a Staff Reliability Engineer, you will be a core player on the reliability team and are expected to build and grow the skillsets of the more junior Engineers.

Key Responsibilities

  • 50% Delivery and Execution - Develops, tests, deploys, and maintains software, with a clear understanding of the value the software is to provide; Takes a broad view when approaching issues; using a global lens; Consistently achieves results, even under tough circumstances; Develops test suites (functional, destructive, etc) to enable success, rapid deployment of code to production; Takes on new opportunities and tough challenges with a sense of urgency, high energy and enthusiasm; Consistently achieves results, even under tough circumstances
  • 10% Learns and Grows - Actively seeks ways to grow and be challenged using both formal and informal development channels; Learns through successful and failed experiment when tackling new problems
  • 20% Plans and Aligns - Creates new and better ways for the organization to be successful; Delivers multi-mode communications that convey a clear understanding of the unique needs of different audiences; Works the Product Team to ensure user stories are developer ready, easy to understand and testable; Collaborates with other team members in agile processes; Relates openly and comfortably with diverse groups of people; Adapts approach and demeanor in real time to match the shifting demands of different situations
  • 20% Supports and Enables - Fields questions from product and engineering teams; Helps grow junior engineers by providing guidance on modern software development frameworks, and leading technical discussions; Notes gaps on the team and provides suggestions for changes to make the team more productive

Direct Manager/Direct Reports

  • This position typically reports to Software Engineer Manager or Sr. Manager
  • This position typically has 0 Direct Reports

Travel Requirements

  • No travel required.

Physical Requirements

  • Most of the time is spent sitting in a comfortable position and there is frequent opportunity to move about. On rare occasions there may be a need to move or lift light articles.

Working Conditions

  • Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable.

Minimum Qualifications

  • Must be eighteen years of age or older.
  • Must be legally permitted to work in the United States.

Preferred Qualifications

  • 3-5 years of relevant work experience
  • Extensive experience with infrastructure automation tools such as Terraform, Ansible, or Chef
  • Experience architecting solutions in Google Cloud, AWS, or similar cloud platforms
  • Experience with monitoring and observability tools like Prometheus, Grafana, or Datadog
  • Extensive experience and competence in core Reliability Engineering Principals and Practices
  • Familiarity with both Unix and Windows operating systems
  • Experience with security frameworks for user and services authorization and authentication
  • Experience in creating and executing unit, functional, destructive, and performance tests
  • Experience with modern debugging and root cause analysis techniques in distributed systems
  • Experience with version control systems
  • Experience in designing systems for High Availability, Disaster Recovery, Performance, Efficiency, and Security
  • Exposure to developing technical roadmaps, including work estimation, refactoring, and modernizing legacy systems
  • Operational support experience with a focus on system reliability
  • Ability to lead teams and share knowledge across engineering functions
  • Experience creating Standard Operating Procedures (SOPs) and collaborating with Principal Engineering to maintain operational excellence
  • Strong communication and collaboration skills with staff engineers and business partners, particularly around fostering stability and continuous improvement

Minimum Education

  • The knowledge, skills and abilities typically acquired through the completion of a bachelor's degree program or equivalent degree in a field of study related to the job.

Preferred Education

  • No additional education

Minimum Years Of Work Experience

  • 3

Preferred Years Of Work Experience

  • No additional years of experience

Minimum Leadership Experience

  • None

Preferred Leadership Experience

  • None

Certifications

  • None

Competencies

  • Global Perspective
  • Manages Ambiguity
  • Nimble Learning
  • Self-Development
  • Collaborates
  • Cultivates Innovation
  • Situational Adaptability
  • Communicates Effectively
  • Drives Results
  • Interpersonal Savvy

We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.

Apply End Date: 05/27/2025

  • $120,000.00 - $190,000.00

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    IT Services and IT Consulting

Referrals increase your chances of interviewing at Lensa by 2x

Sign in to set job alerts for “Staff Software Engineer” roles.
Principal Software Engineer, Payroll Services
Lead ServiceNow Developer - ITSM,ITOM,ITAM - REMOTE
Staff Software Engineer, Backend - Python/Java (Remote)
Senior Software Reliability Engineer (Remote)
Senior Software Engineer-Full Stack Developer
Software Engineer Senior Manager – Mobile Application Development (Remote)
Software Engineer Principal - Retail Media Data Systems (Remote)
Software Engineer Senior Principal - GenAI (Remote)
Staff Software Engineer, Reliability Engineer - Observability (Remote)
Senior Full-Stack Software Engineer (Java / React) - (25040101)

Atlanta, GA $120,000.00-$140,000.00 5 days ago

Software Engineer Principal - Retail Media Data Systems (Remote)
Senior Backend Software Developer (Remote)

Atlanta Metropolitan Area $65.00-$85.00 2 days ago

Senior Staff Software Engineer, Payments

Atlanta Metropolitan Area $191,000.00-$265,000.00 6 days ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Staff Software Engineer, Reliability Engineer - Store Systems & Services (Remote)

Lensa

Atlanta

Remote

USD 120,000 - 190,000

Yesterday
Be an early applicant

Staff Database Reliability Engineer (DBRE) United States - Remote

Demandbase

Atlanta

Remote

USD 140,000 - 210,000

28 days ago

Senior / Staff Site Reliability Engineer

Scroll.io

Remote

USD 120,000 - 180,000

2 days ago
Be an early applicant

Staff Site Reliability Engineer - FedRAMP

Tenable Network Security, Inc.

Remote

USD 100,000 - 160,000

15 days ago

[Hiring] Staff Site Reliability Engineer @Primer.ai

Primer.ai

Remote

USD 180,000 - 230,000

21 days ago

Staff Database Reliability Engineer (DBRE)

Demandbase

Remote

USD 140,000 - 210,000

27 days ago

Staff Infrastructure Site Reliability Engineer

Netlify

Remote

USD 100,000 - 160,000

24 days ago

[Hiring] Staff Site Reliability Engineer @Wikimedia Foundation

Wikimedia Foundation

Remote

USD 129,000 - 201,000

30+ days ago

Senior / Staff Site Reliability Engineer

Scroll.io

Mission

Remote

USD 90,000 - 150,000

30+ days ago