Enable job alerts via email!

Site Reliability Engineer

ZipRecruiter

Alpharetta (GA)

Remote

USD 100,000 - 140,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

Join a leading company as a Site Reliability Engineer, playing a critical role in ensuring the reliability and scalability of cloud-based services. You will leverage your expertise to enhance security and performance, manage AWS infrastructures, and lead migrations while guiding teams in best practices for DevOps tooling and cloud operations. Ideal candidates will have a solid background in cybersecurity, CI/CD, and containerization technologies.

Qualifications

  • 3-5 years of hands-on experience in the cybersecurity field.
  • Experience running and supporting containerized workloads in production environments.
  • Familiarity with observability and logging tools to ensure system performance.

Responsibilities

  • Lead the migration of EC2 workloads to ECS and develop DevOps tooling.
  • Design proactive monitoring solutions using Prometheus and Grafana.
  • Uphold SLAs and SLOs by applying SRE best practices.

Skills

CI/CD
Networking principles
Cloud infrastructure management
DevOps tooling
Containerization
Monitoring tools
Infrastructure as Code

Education

Bachelor's degree in computer science
Equivalent work experience in cybersecurity

Tools

GitHub Actions
AWS
Terraform
ECS
PostgreSQL
Prometheus
Grafana

Job description

Job DescriptionJob DescriptionSite Reliability Engineer

As a Site Reliability Engineer at DefenseStorm you will be playing a crucial role in ensuring the reliability, scalability, and performance of our cloud-based services. GRID is a high-throughput, data intensive application that currently handles 250k events/sec. You will drive best practices and contribute to both the design and implementation of robust cloud infrastructures that can scale rapidly to support the growing customer base of DefenseStorm.

Location
Atlanta, GA
Remote

Job Duties and Responsibilities

  • Lead the migration of EC2 workloads to ECS and develop DevOps tooling to empower development teams to build and manage containerized applications.
  • Advance zero trust security initiatives by implementing a service mesh architecture with technologies such as Istio.
  • Enhance the security, scalability, and reliability of AWS cloud- infrastructure through continuous improvement and innovation.
  • Design and implement proactive monitoring and alerting solutions using tools like Prometheus, Grafana, and OpsGenie, leveraging data-driven insights to optimize uptime and mitigate operational risks.
  • Uphold SLAs and SLOs by applying SRE best practices, including incident response, post-mortem analysis, and the creation of operational playbooks.
  • Build, manage, and scale cloud infrastructure using Infrastructure as Code (IaC) tools such as Terraform.
  • Support SOC 2 and ISO compliance efforts by championing security best practices, streamlining evidence collection, and introducing automation to improve audit processes.
  • Other duties as assigned by management

Required Education and Experience

  • Hands-on experience building and maintaining CI/CD pipelines using tools such as GitHub Actions.
  • Strong understanding of networking principles and their application in cloud and containerized environments.
  • Proven experience designing, building, and managing cloud infrastructure in AWS.
  • Expertise with Infrastructure as Code (IaC) and deployment automation tools to streamline environment provisioning and management.
  • Experience running and supporting containerized workloads in production environments.
  • Familiarity with observability, monitoring, logging, and tracing tools to ensure system performance, reliability, and visibility.
  • Experience using AWS, ECS, Elasticsearch, PostgreSQL, Prometheus, Grafana, GitHub Actions, Terraform

Education and Experience

  • Bachelor's degree in computer science or equivalent work experience
  • 3-5 years of hands-on experience in the cybersecurity field

DefenseStorm provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to , , , , , , status, genetics, protected veteran status, , or expression, or any other characteristic protected by federal, state or local laws.

Powered by JazzHR

lGwBVg4akq

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Site Reliability Engineer

Jobot

Atlanta null

Remote

Remote

USD 100,000 - 130,000

Full time

2 days ago
Be an early applicant

Staff Software Engineer - Reliability Engineer (Remote)

The Home Depot

Atlanta null

Remote

Remote

USD 110,000 - 140,000

Full time

Yesterday
Be an early applicant

Junior Site Reliability Engineer (Remote)

Lensa

null null

Remote

Remote

USD 80,000 - 140,000

Full time

Yesterday
Be an early applicant

Site Reliability Engineer

Jobot

Roanoke null

Remote

Remote

USD 100,000 - 130,000

Full time

Yesterday
Be an early applicant

Site Reliability Engineer

Jobot

Jackson null

Remote

Remote

USD 100,000 - 130,000

Full time

Yesterday
Be an early applicant

Site Reliability Engineer

Jobs via Dice

Atlanta null

Remote

Remote

USD 93,000 - 158,000

Full time

28 days ago

Site Reliability Engineer

Jobot

Cape Coral null

Remote

Remote

USD 100,000 - 130,000

Full time

Yesterday
Be an early applicant

Site Reliability Engineer

Jobot

Evansville null

Remote

Remote

USD 100,000 - 130,000

Full time

Yesterday
Be an early applicant

Senior Site Reliability Engineer ›

Filevine

null null

Remote

Remote

USD 120,000 - 170,000

Full time

2 days ago
Be an early applicant