Enable job alerts via email!

Site Reliability Engineer

AIS (Applied Information Sciences)

Washington (District of Columbia)

Remote

USD 75,000 - 100,000

Full time

Yesterday

Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a Site Reliability Engineer to join their DevOps team. This role focuses on ensuring the reliability and performance of cloud infrastructure. The ideal candidate will have experience in Infrastructure as Code, cloud technologies, and strong communication skills. This is a fully remote position with opportunities for growth and collaboration.

Qualifications

Minimum 3 years in Site Reliability Engineering or related field.
Experience with cloud technologies, preferably Azure.
Proficiency in scripting languages.

Responsibilities

Design, deploy, and manage scalable cloud infrastructure.
Implement monitoring and optimization for high availability.
Respond to and resolve incidents related to cloud infrastructure.

Skills

Communication Skills

Scripting

Knowledge

Education

Azure Certification

Tools

Terraform

PowerShell

Bash

Python

Prometheus

Grafana

Azure Monitor

Join to apply for the Site Reliability Engineer role at AIS (Applied Information Sciences)

1 day ago Be among the first 25 applicants

Join to apply for the Site Reliability Engineer role at AIS (Applied Information Sciences)

If you’re seeking a sense of community and the ability for growth, look no further. Since 1982, we have been 100% dedicated to our people. Our approach permits greater ownership for individuals and welcomes input into decisions for a thriving workplace and happy employees. Our people are the core reason for AIS’ success. As an employee owned company, we are looking for individuals that are passionate about finding innovative solutions, and excited about emerging technologies and capabilities.

Introduction

We are seeking a skilled Site Reliability Engineer to join our cross-functional scrum team for the DevOps – System Development Services (DO-SDS) project. The successful candidate will be responsible for ensuring the reliability, availability, and performance of the cloud infrastructure and applications. This role involves collaborating with development teams to design, build, and maintain scalable and resilient systems.

What You Will Be Doing

Infrastructure Management: Design, deploy, and manage scalable, highly available, and secure cloud infrastructure using Infrastructure as Code (IaC) principles such as Terraform and ARM templates.
Monitoring and Optimization: Implement proactive measures to monitor, analyze, and optimize the cloud environment ensuring high availability and optimal resource utilization.
Incident Management: Respond to and resolve incidents related to cloud infrastructure and applications, ensuring minimal downtime and impact on users.
Automation: Develop and maintain automation scripts and tools to streamline operations and improve efficiency.
Collaboration: Work closely with development teams to ensure seamless integration of applications and services, and provide guidance on best practices for reliability and performance.
Security and Compliance: Implement and maintain security best practices, including access controls, encryption, and identity management using Azure AD and other tools.
Documentation: Maintain comprehensive documentation of cloud infrastructure, configurations, and processes to ensure knowledge sharing and continuity.
Training and Knowledge Transfer: Provide training and knowledge transfer to junior engineers and other team members on cloud infrastructure and Site Reliability Engineering practices.

Location and Clearance Requirements

This is a remote position with occasional travel. The ability to obtain and maintain a Public Trust Clearance is required.

Required For This Opportunity

Experience: Minimum of 3 years of experience in Site Reliability Engineering or a related field.
IaC Development: Minimum of 3 years of experience developing and deploying Infrastructure as Code (IaC) using Terraform and ARM templates.
Cloud Technologies: Minimum of 3 years of experience with cloud technologies, preferably Azure.
Azure Certification: Azure Certification (e.g., Microsoft Certified Azure Administrator Associate or Azure Solutions Architect Expert).
Knowledge: Demonstrated knowledge of cloud services, including virtual machines, storage, networking, and Azure AD.
Scripting: Proficiency in scripting languages such as PowerShell, Bash, and Python.
Monitoring Tools: Hands-on experience with monitoring tools such as Prometheus, Grafana, and Azure Monitor.
Communication Skills: Exceptional verbal and written communication skills to effectively collaborate with team members and stakeholders.

Nice To Have Skills

Prior experience with the Treasury is nice to have.

Applied Information Sciences does not discriminate on the basis of race, national origin, religion, color, gender, sexual orientation, age, disability, protected veteran status, or any other basis. Employment decisions are based solely on qualifications, merit, and business needs.

Seniority level

Seniority level
Mid-Senior level

Employment type

Employment type
Full-time

Job function

Job function
Engineering and Information Technology
Industries
IT Services and IT Consulting

Referrals increase your chances of interviewing at AIS (Applied Information Sciences) by 2x

Get notified about new Site Reliability Engineer jobs in Washington, DC.

Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - USA)

Washington, DC $75,000.00-$100,000.00 2 months ago

Site Reliability Engineer (FULLY REMOTE)

Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

Washington, DC $125,000.00-$155,000.00 10 months ago

Observability Capacity SRE Engineer (East Coast, FULLY REMOTE)

Arlington, VA $150,000.00-$200,000.00 6 months ago

Washington, DC $65,000.00-$185,000.00 9 months ago

Arlington, VA $90,000.00-$105,000.00 1 month ago

Washington, DC $100,000.00-$125,000.00 3 months ago

Machine Learning Engineer, AI Platform (FULLY REMOTE, USA ONLY)

District of Columbia, United States $90,000.00-$145,000.00 5 months ago

Maryland, United States $90,000.00-$160,000.00 5 months ago

Security Engineer with Cloud Operations - 100% Remote

Herndon, VA $140,000.00-$160,000.00 2 months ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.