Enable job alerts via email!

Lead Site Reliability Engineer

General Dynamics Information Technology

United States

Remote

USD 144,000 - 196,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is looking for a skilled Site Reliability Engineer to join their cloud team. This role involves enhancing the reliability of core cloud infrastructure and collaborating with government stakeholders. You'll develop monitoring strategies, assess infrastructure performance, and engage in continuous improvement activities. With a focus on AWS, this position offers the chance to work with cutting-edge technology in a remote setting, while also providing opportunities for professional growth and a comprehensive benefits package. If you have a passion for cloud solutions and a strong technical background, this role is perfect for you.

Benefits

Health Insurance
Retirement Plan
Paid Time Off
Professional Growth Opportunities

Qualifications

  • 10+ years in AWS infrastructure design and deployment.
  • 3+ years in a complex SRE role with strong analytical skills.

Responsibilities

  • Enhance reliability of AWS infrastructure and oversee core accounts.
  • Collaborate with stakeholders on logging and monitoring strategies.

Skills

AWS DevOps
Cloud Infrastructure
Cloud Service Automation
Cloud Testing
IT Monitoring

Tools

IaC (CDK or CloudFormation)
CloudWatch
Splunk
Instana

Job description

Type of Requisition: Regular

Clearance Level Must Currently Possess: None

Clearance Level Must Be Able to Obtain: None

Public Trust/Other Required: Other

Job Family: Cloud

Job Qualifications:

  • Skills: AWS DevOps, Cloud Infrastructure, Cloud Service Automation, Cloud Testing, IT Monitoring
  • Certifications: None
  • Experience: 10+ years of related experience
  • US Citizenship Required: No

Job Description:

GDIT is seeking a lead Site Reliability Engineer (SRE) to elevate our cloud team. You will collaborate with government and team members to enhance the reliability of the agency's core cloud infrastructure. As an SRE, you will act as an Account Manager and developer for core AWS accounts, overseeing services within the agency’s infrastructure AWS accounts.

HOW A SITE RELIABILITY ENGINEER WILL MAKE AN IMPACT

  • Develop a deep understanding of system inter-operability within the infrastructure, including dependencies.
  • Review all AWS infrastructure deployments to assess impacts and validate test processes, including Change Management activities.
  • Ensure proper configuration of monitoring, logging, and alerting for services in core infrastructure accounts, developing new solutions as needed.
  • Create metrics to assess infrastructure performance and participate in incident response activities, including post-mortem analyses.
  • Collaborate with stakeholders to develop and maintain logging and monitoring strategies, coding as necessary.
  • Engage in continuous improvement activities, technical debt analysis, and contribute to reliability standards.
  • Work with DevOps engineers to improve deployment processes and automate testing.
  • Audit resources, identify improvements, and troubleshoot integration issues.
  • Update infrastructure codebase with necessary changes.

WHAT YOU’LL NEED TO SUCCEED:

  • 10+ years in AWS infrastructure design and deployment; 3+ years in a complex SRE role.
  • Technical skills: IaC (CDK or CloudFormation), logging and monitoring tools (CloudWatch, Splunk, Instana).
  • Strong analytical, communication skills, and ability to work with government stakeholders.
  • Preferred: AWS Solutions Architect Professional or DevOps Engineer Professional Certification.

Location: Remote with on-site meetings in the DC Metro area.

Salary range: $144,500 - $195,500, based on experience and location.

Additional details: 40 hours/week, less than 10% travel, telecommuting options available, work location in VA.

GDIT offers comprehensive benefits including health, retirement, paid time off, and professional growth opportunities. Join us to work on cutting-edge technology and make an impact.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Principal Site Reliability Engineer

Lumen Technologies

Remote

USD 149,000 - 199,000

Today
Be an early applicant

Principal Site Reliability Engineer

Lumen Argentina

Aurora

Remote

USD 156,000 - 209,000

2 days ago
Be an early applicant

Lead Site Reliability Engineer/Architect (Remote)

Cognizant

Riverwoods

Remote

USD 120,000 - 162,000

4 days ago
Be an early applicant

Lead Site Reliability Engineer (AZURE) - Empower Product Group

Hitachi Solutions

Greenville

Remote

USD 142,000 - 199,000

6 days ago
Be an early applicant

Lead Site Reliability Engineer/Architect (Remote)

Cognizant North America

Riverwoods

Remote

USD 120,000 - 162,000

7 days ago
Be an early applicant

Lead Site Reliability Engineer - Catalyst

IO Global

Remote

USD 150,000 - 175,000

8 days ago

Principal Site Reliability Engineer

Atlassian

Aurora

Remote

USD 170,000 - 275,000

25 days ago

Lead Site Reliability Engineer - Cloud Platforms

Jobot

Kalamazoo

Remote

USD 160,000 - 200,000

Yesterday
Be an early applicant

Lead Site Reliability Engineer

Corelight

San Francisco

Remote

USD 184,000 - 229,000

6 days ago
Be an early applicant