Enable job alerts via email!

Site Reliability Engineer

Applied Information Sciences, Inc.

Odessa (TX)

Remote

USD 90,000 - 120,000

Full time

4 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company is seeking a skilled Site Reliability Engineer to join their DevOps team. This remote position involves managing cloud infrastructure, ensuring reliability, and collaborating with development teams. Candidates should have experience in IaC, cloud technologies, and strong communication skills. The role offers opportunities for growth and requires a Public Trust Clearance.

Qualifications

  • Minimum of 3 years of experience in Site Reliability Engineering or related field.
  • Proficiency in scripting languages such as PowerShell, Bash, and Python.
  • Experience developing and deploying Infrastructure as Code (IaC) using Terraform.

Responsibilities

  • Design, deploy, and manage scalable cloud infrastructure using IaC principles.
  • Implement proactive measures to monitor and optimize the cloud environment.
  • Respond to and resolve incidents related to cloud infrastructure.

Skills

Communication Skills
Scripting
Knowledge of Cloud Services

Education

Azure Certification

Tools

Terraform
Azure AD
Prometheus
Grafana
Azure Monitor

Job description

If you're seeking a sense of community and the ability for growth, look no further. Since 1982, we have been 100% dedicated to our people. Our approach permits greater ownership for individuals and welcomes input into decisions for a thriving workplace and happy employees. Our people are the core reason for AIS' success. As an employee owned company, we are looking for individuals that are passionate about finding innovative solutions, and excited about emerging technologies and capabilities.

Introduction

We are seeking a skilled Site Reliability Engineer to join our cross-functional scrum team for the DevOps - System Development Services (DO-SDS) project. The successful candidate will be responsible for ensuring the reliability, availability, and performance of the cloud infrastructure and applications. This role involves collaborating with development teams to design, build, and maintain scalable and resilient systems.

What you will be doing

  • Infrastructure Management: Design, deploy, and manage scalable, highly available, and secure cloud infrastructure using Infrastructure as Code (IaC) principles such as Terraform and ARM templates.

  • Monitoring and Optimization: Implement proactive measures to monitor, analyze, and optimize the cloud environment ensuring high availability and optimal resource utilization.

  • Incident Management: Respond to and resolve incidents related to cloud infrastructure and applications, ensuring minimal downtime and impact on users.

  • Automation: Develop and maintain automation scripts and tools to streamline operations and improve efficiency.

  • Collaboration: Work closely with development teams to ensure seamless integration of applications and services, and provide guidance on best practices for reliability and performance.

  • Security and Compliance: Implement and maintain security best practices, including access controls, encryption, and identity management using Azure AD and other tools.

  • Documentation: Maintain comprehensive documentation of cloud infrastructure, configurations, and processes to ensure knowledge sharing and continuity.

  • Training and Knowledge Transfer: Provide training and knowledge transfer to junior engineers and other team members on cloud infrastructure and Site Reliability Engineering practices.

Location and Clearance Requirements

This is a remote position with occasional travel. The ability to obtain and maintain a Public Trust Clearance is required.

Required for this Opportunity

  • Experience: Minimum of 3 years of experience in Site Reliability Engineering or a related field.

  • IaC Development: Minimum of 3 years of experience developing and deploying Infrastructure as Code (IaC) using Terraform and ARM templates.

  • Cloud Technologies: Minimum of 3 years of experience with cloud technologies, preferably Azure.

  • Azure Certification: Azure Certification (e.g., Microsoft Certified Azure Administrator Associate or Azure Solutions Architect Expert).

  • Knowledge: Demonstrated knowledge of cloud services, including virtual machines, storage, networking, and Azure AD.

  • Scripting: Proficiency in scripting languages such as PowerShell, Bash, and Python.

  • Monitoring Tools: Hands-on experience with monitoring tools such as Prometheus, Grafana, and Azure Monitor.

  • Communication Skills: Exceptional verbal and written communication skills to effectively collaborate with team members and stakeholders.

Nice To Have Skills

  • Prior experience with the Treasury is nice to have.

Applied Information Sciences does not discriminate on the basis of race, national origin, religion, color, gender, sexual orientation, age, disability, protected veteran status, or any other basis. Employment decisions are based solely on qualifications, merit, and business needs.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Lead Site Reliability Engineer - Remote - 2290929

Primary Care Plus

Minnetonka

Remote

USD 106,000 - 195,000

2 days ago
Be an early applicant

Site Reliability Engineer

McKesson Pharmacy Automation

Columbus

Remote

USD 88,000 - 148,000

2 days ago
Be an early applicant

Site Reliability Engineer

McKesson

Columbus

Remote

USD 88,000 - 148,000

Yesterday
Be an early applicant

Lead Site Reliability Engineer (Remote -CST)

Cognizant

Juneau

Remote

USD 81,000 - 142,000

Yesterday
Be an early applicant

Site Reliability Engineer (FULLY REMOTE)

Splunk

Georgia

Remote

USD 90,000 - 130,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer - Data (REMOTE)

Discogs

Remote

USD 80,000 - 100,000

Yesterday
Be an early applicant

Site Reliability Engineer (FULLY REMOTE)

Cisco

Oregon

Remote

USD 90,000 - 130,000

Yesterday
Be an early applicant

Site Reliability Engineer (FULLY REMOTE)

Cisco

Georgia

Remote

USD 90,000 - 130,000

Yesterday
Be an early applicant

Site Reliability Engineer (FULLY REMOTE)

Cisco

Town of Texas

Remote

USD 90,000 - 130,000

Yesterday
Be an early applicant