Enable job alerts via email!

Sr. Site Reliability Engineer

techolution

California (MO)

Remote

USD 120,000 - 160,000

Full time

Today
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading tech consulting firm is seeking a Sr AWS Site Reliability Engineer to enhance cloud infrastructure reliability and security. This remote role requires expertise in AWS, DevOps, and automation, focusing on high-availability systems and incident response.

Benefits

401(k)
Medical insurance
Vision insurance

Qualifications

  • Strong experience in AWS services like EC2, Lambda, EKS.
  • Expertise in Infrastructure as Code tools like Terraform.
  • Experience with CI/CD pipelines using Jenkins or GitHub Actions.

Responsibilities

  • Design and maintain highly available AWS infrastructure.
  • Monitor and troubleshoot system issues.
  • Develop Infrastructure as Code using Terraform.

Skills

AWS services
Infrastructure as Code
CI/CD pipelines
monitoring
scripting
problem-solving

Education

AWS Solution Architect Associate
AWS Solution Architect Professional
AWS DevOps Professional

Tools

Terraform
CloudFormation
Jenkins
GitHub Actions
Docker
Kubernetes
Prometheus
ELK
Datadog

Job description

6 days ago Be among the first 25 applicants

Get AI-powered advice on this job and more exclusive features.

Direct message the job poster from techolution

Algorithmic Talent Match Specialist @techolution I Ex-Randstad I Ex- Amazon

We are seeking a highly skilled Sr AWS Site Reliability Engineer (SRE) to enhance the reliability, scalability, and security of our cloud infrastructure. The ideal candidate will be responsible for designing, implementing, and maintaining high-availability systems, automating processes, and ensuring seamless operations on AWS. This role requires expertise in DevOps, cloud automation, monitoring, and incident response.

Title : Sr AWS Site Reliability Engineer (SRE)

Location : Remote Work (We need someone who are located in PST time zone)

Employment Type: Full-time

Please Note : Due to the nature of the government project, U.S. citizenship is required.

Responsibilities:

  • Design and maintain highly available, scalable, and fault-tolerant AWS infrastructure to ensure system reliability and performance.
  • Proactively monitor and troubleshoot system issues, minimizing downtime and optimizing system performance.
  • Develop and maintain Infrastructure as Code (IaC) using Terraform, CloudFormation, or AWS CDK to automate deployments and infrastructure management.
  • Implement and optimize continuous integration and deployment (CI/CD) pipelines using tools like Jenkins, GitLab CI/CD, or AWS CodePipeline.
  • Ensure AWS environments meet security best practices, including IAM policies, network security configurations, and compliance requirements.
  • Set up and manage monitoring and logging solutions using tools such as Prometheus, AWS CloudWatch, ELK Stack, and Datadog.
  • Identify and address performance bottlenecks through load balancing, caching strategies, and system optimizations.
  • Work closely with developers, security teams, and product managers to enhance system architecture and operational efficiency.

Required Skills & Experience

  • Strong experience in AWS services such as EC2, Lambda, EKS, S3, SageMaker, DynamoDB, and IAM.
  • Expertise in Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
  • Proficiency in CI/CD pipelines using GitHub Actions, Jenkins, or AWS CodePipeline.
  • Experience with containerization and orchestration (Docker, Kubernetes, Helm).
  • Strong knowledge of monitoring, logging, and alerting tools (CloudWatch, Prometheus, ELK, Datadog).
  • Solid Python, Bash, or Golang scripting skills for automation.
  • Experience working with ML models in production environments is a plus.
  • Familiarity with security best practices (IAM, VPC security, encryption, WAF).
  • Strong problem-solving and troubleshooting skills.

Preferred Qualifications

  • Experience with MLOps frameworks and AI model deployment.
  • Knowledge of AWS AI/ML services like SageMaker, Bedrock, or AI pipelines.
  • Hands-on experience with Kafka, Spark, or other big data technologies.

At least one of the following certifications:

  • AWS Solution Architect Associate
  • AWS Solution Architect Professional
  • AWS DevOps Professional

About Techolution :

Techolution is a next gen Consulting firm on track to become one of the most admired brands in the world for "innovation done right". Our purpose is to harness our expertise in novel technologies to deliver more profits for our enterprise clients while helping them deliver a better human experience for the communities they serve.

With that, we are now fully committed to helping our clients build the enterprise of tomorrow by making the leap from Lab Grade AI to Real World AI. Other focus areas being Enterprise Cloud, Product Innovation (IoT, 3D printing, Robotics), Real World AI Services (CV, LLM, CNN).

We are honored to have recently received the prestigious Inc 500 Best In Business award, a testament to our commitment to excellence. We were also awarded - AI Solution Provider of the Year by The AI Summit 2023, Platinum sponsor at Advantage DoD 2024 Symposium and a lot more exciting stuff! While we are big enough to be trusted by some of the greatest brands in the world, we are small enough to care about delivering meaningful ROI-generating innovation at a guaranteed price for each client that we serve.

Our thought leader, Luv Tulsidas, wrote and published a book in collaboration with Forbes, “Failing Fast? Secrets to succeed fast with AI”. Refer here for more details on the content - https://www.luvtulsidas.com/

Let's explore further!

Uncover our unique AI accelerators with us:

1. Enterprise LLM Studio: Our no-code DIY AI studio for enterprises. Choose an LLM, connect it to your data, and create an expert-level agent in 20 minutes.

2. AppMod. AI: Modernizes ancient tech stacks quickly, achieving over 80% autonomy for major brands!

3. ComputerVision. AI: Our ComputerVision. AI Offers customizable Computer Vision and Audio AI models, plus DIY tools and a Real-Time Co-Pilot for human-AI collaboration!

4. Robotics and Edge Device Fabrication: Provides comprehensive robotics, hardware fabrication, and AI-integrated edge design services.

5. RLEF AI Platform: Our proven Reinforcement Learning with Expert Feedback (RLEF) approach bridges Lab-Grade AI to Real-World AI.

6. AI Center of Excellence: Establishes an AI Center of Excellence to maximize AI potential and ROI.

7. FaceOpen: AI-powered user identification system using image recognition and deep neural networks, eliminating the need for keys, badges, or fingerprint scanners!

  • Computer Vision demo at The AI Summit New York 2023
  • Life at Techolution
  • GoogleNext 2023
  • Ai4 - Artificial Intelligence Conferences 2023
  • WaWa - Solving Food Wastage
  • Saving lives - Brooklyn Hospital
  • Techolution featured on Worldwide Business with KathyIreland
  • Techolution presented by ION World’s Greatest

Visit us @www.techolution.com : To know more about our revolutionary core practices and getting to know in detail about how we enrich the human experience with technology.

Seniority level
  • Seniority level
    Mid-Senior level
Employment type
  • Employment type
    Full-time
Job function
  • Job function
    Information Technology and Engineering
  • Industries
    IT Services and IT Consulting

Referrals increase your chances of interviewing at techolution by 2x

Inferred from the description for this job

401(k)

Medical insurance

Vision insurance

Get notified when a new job is posted.

Sign in to set job alerts for “Site Reliability Engineer” roles.
CDN Site Reliability Engineer L4/L5 - Live Streaming, Open Connect CDN
Site Reliability Engineer (SRE, Remote US)

San Francisco, CA $120,000.00-$160,000.00 2 months ago

Site Reliability Engineer L4 - Live Streaming Services

San Francisco, CA $93,000.00-$104,000.00 2 weeks ago

Observability Capacity SRE Engineer (West Coast, FULLY REMOTE)

California, United States $125,000.00-$145,000.00 2 weeks ago

Software Engineer Internship (12 months)
Remote Principal Cloud Operations Engineer

Santa Barbara, CA $170,000.00-$220,000.00 16 hours ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Site Reliability Engineer

Bitwarden Inc.

California

Remote

USD 120,000 - 185,000

14 days ago

Senior Site Reliability Engineer New United State (Remote)

Runwise

Mississippi

Remote

USD 140,000 - 190,000

Today
Be an early applicant

Senior Site Reliability Engineer

ZipRecruiter

Santa Barbara

Remote

USD 140,000 - 160,000

Today
Be an early applicant

Senior Site Reliability Engineer - 2289298

Optum

Eden Prairie

Remote

USD 103,000 - 192,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer - 2289298

UnitedHealth Group

Eden Prairie

Remote

USD 103,000 - 192,000

Yesterday
Be an early applicant

Senior Site Reliability Engineer

Nami Technology Joint Stock Company

Remote

USD 120,000 - 160,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer-FedRAMP (FULLY REMOTE)

Splunk Inc.

California

Remote

USD 157,000 - 217,000

30+ days ago

Senior Site Reliability Engineer

Optimism

New York

Remote

USD 120,000 - 160,000

2 days ago
Be an early applicant

Senior Site Reliability Engineer

Exabeam

Remote

USD 90,000 - 150,000

3 days ago
Be an early applicant