Enable job alerts via email!

Site Reliability Engineer

AIS (Applied Information Sciences)

Washington (District of Columbia)

Remote

USD 75,000 - 100,000

Full time

Yesterday
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a Site Reliability Engineer to join their DevOps team. This role focuses on ensuring the reliability and performance of cloud infrastructure. The ideal candidate will have experience in Infrastructure as Code, cloud technologies, and strong communication skills. This is a fully remote position with opportunities for growth and collaboration.

Qualifications

  • Minimum 3 years in Site Reliability Engineering or related field.
  • Experience with cloud technologies, preferably Azure.
  • Proficiency in scripting languages.

Responsibilities

  • Design, deploy, and manage scalable cloud infrastructure.
  • Implement monitoring and optimization for high availability.
  • Respond to and resolve incidents related to cloud infrastructure.

Skills

Communication Skills
Scripting
Knowledge

Education

Azure Certification

Tools

Terraform
PowerShell
Bash
Python
Prometheus
Grafana
Azure Monitor

Job description

Join to apply for the Site Reliability Engineer role at AIS (Applied Information Sciences)

1 day ago Be among the first 25 applicants

Join to apply for the Site Reliability Engineer role at AIS (Applied Information Sciences)

If you’re seeking a sense of community and the ability for growth, look no further. Since 1982, we have been 100% dedicated to our people. Our approach permits greater ownership for individuals and welcomes input into decisions for a thriving workplace and happy employees. Our people are the core reason for AIS’ success. As an employee owned company, we are looking for individuals that are passionate about finding innovative solutions, and excited about emerging technologies and capabilities.

Introduction

We are seeking a skilled Site Reliability Engineer to join our cross-functional scrum team for the DevOps – System Development Services (DO-SDS) project. The successful candidate will be responsible for ensuring the reliability, availability, and performance of the cloud infrastructure and applications. This role involves collaborating with development teams to design, build, and maintain scalable and resilient systems.

What You Will Be Doing

  • Infrastructure Management: Design, deploy, and manage scalable, highly available, and secure cloud infrastructure using Infrastructure as Code (IaC) principles such as Terraform and ARM templates.
  • Monitoring and Optimization: Implement proactive measures to monitor, analyze, and optimize the cloud environment ensuring high availability and optimal resource utilization.
  • Incident Management: Respond to and resolve incidents related to cloud infrastructure and applications, ensuring minimal downtime and impact on users.
  • Automation: Develop and maintain automation scripts and tools to streamline operations and improve efficiency.
  • Collaboration: Work closely with development teams to ensure seamless integration of applications and services, and provide guidance on best practices for reliability and performance.
  • Security and Compliance: Implement and maintain security best practices, including access controls, encryption, and identity management using Azure AD and other tools.
  • Documentation: Maintain comprehensive documentation of cloud infrastructure, configurations, and processes to ensure knowledge sharing and continuity.
  • Training and Knowledge Transfer: Provide training and knowledge transfer to junior engineers and other team members on cloud infrastructure and Site Reliability Engineering practices.

Location and Clearance Requirements

This is a remote position with occasional travel. The ability to obtain and maintain a Public Trust Clearance is required.

Required For This Opportunity

  • Experience: Minimum of 3 years of experience in Site Reliability Engineering or a related field.
  • IaC Development: Minimum of 3 years of experience developing and deploying Infrastructure as Code (IaC) using Terraform and ARM templates.
  • Cloud Technologies: Minimum of 3 years of experience with cloud technologies, preferably Azure.
  • Azure Certification: Azure Certification (e.g., Microsoft Certified Azure Administrator Associate or Azure Solutions Architect Expert).
  • Knowledge: Demonstrated knowledge of cloud services, including virtual machines, storage, networking, and Azure AD.
  • Scripting: Proficiency in scripting languages such as PowerShell, Bash, and Python.
  • Monitoring Tools: Hands-on experience with monitoring tools such as Prometheus, Grafana, and Azure Monitor.
  • Communication Skills: Exceptional verbal and written communication skills to effectively collaborate with team members and stakeholders.

Nice To Have Skills

  • Prior experience with the Treasury is nice to have.

Applied Information Sciences does not discriminate on the basis of race, national origin, religion, color, gender, sexual orientation, age, disability, protected veteran status, or any other basis. Employment decisions are based solely on qualifications, merit, and business needs.

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Engineering and Information Technology
  • Industries
    IT Services and IT Consulting

Referrals increase your chances of interviewing at AIS (Applied Information Sciences) by 2x

Get notified about new Site Reliability Engineer jobs in Washington, DC.

Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - USA)

Washington, DC $75,000.00-$100,000.00 2 months ago

Site Reliability Engineer (FULLY REMOTE)
Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)
Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

Washington, DC $125,000.00-$155,000.00 10 months ago

Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

Arlington, VA $150,000.00-$200,000.00 6 months ago

Washington, DC $65,000.00-$185,000.00 9 months ago

Arlington, VA $90,000.00-$105,000.00 1 month ago

Washington, DC $100,000.00-$125,000.00 3 months ago

Machine Learning Engineer, AI Platform (FULLY REMOTE, USA ONLY)

District of Columbia, United States $90,000.00-$145,000.00 5 months ago

Maryland, United States $90,000.00-$160,000.00 5 months ago

Security Engineer with Cloud Operations - 100% Remote

Herndon, VA $140,000.00-$160,000.00 2 months ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer

Applied Information Sciences, Inc.

Virginia

Remote

USD 90,000 - 120,000

Today
Be an early applicant

Site Reliability Engineer

10Pearls

Tysons

Remote

USD 75,000 - 100,000

2 days ago
Be an early applicant

Site Reliability Engineer

Element Solutions

Washington

Remote

USD 75,000 - 100,000

12 days ago

Site Reliability Engineer (FULLY REMOTE)

Splunk

North Carolina

Remote

USD 90,000 - 120,000

Today
Be an early applicant

Site Reliability Engineer

IBM

Jersey City

Remote

USD 90,000 - 140,000

14 days ago

Site Reliability Engineer

IBM Computing

Jersey City

Remote

USD 90,000 - 150,000

14 days ago

Site Reliability Engineer

ZipRecruiter

Great Falls Crossing

Remote

USD 80,000 - 120,000

11 days ago

Site Reliability Engineer

Leidos

Remote

USD 85,000 - 154,000

Yesterday
Be an early applicant

Site Reliability Engineer (FULLY REMOTE)

Splunk

Boulder

Remote

USD 90,000 - 130,000

Today
Be an early applicant