Aktiviere Job-Benachrichtigungen per E-Mail!
A global cloud consulting company is hiring a remote Site Reliability Engineer. This role is central to enhancing platform reliability and system observability, requiring expertise in AWS and programming skills in Python and Ruby. You will drive improvements across a distributed environment, ensuring proactive reliability practices. Join us to make a real impact in cloud technology.
Aug 24, 2025 - Virtasant is hiring a remote Site Reliability Engineer. Location: USA.
Remote – North/South America (US Time Zone Overlap Required)
We’re looking for a Site Reliability Engineer to join a high-impact cloud infrastructure team at one of Virtasant’s key technology partners. You'll play a critical role in improving system observability, ensuring platform reliability, and embedding proactive engineering practices across a globally distributed environment.
This is a hands-on technical role ideal for someone who brings a developer’s mindset to SRE work. You’ll be expected to anticipate problems before they happen, automate smart solutions, and elevate how reliability is measured, built, and maintained.
Drive the creation and evolution of observability systems — including dashboards, logging, alerting, and instrumentation.
Identify trends, anomalies, and early warning signs through data analysis.
Work with engineers to drive the adoption of observability best practices across squads.
Surface, propose, and implement proactive reliability improvements across AWS environments.
Contribute to build, test, and deploy workflows (CI/CD), with a strong emphasis on automation.
Collaborate across teams using agile ceremonies, async-first workflows, and direct feedback loops.
Deep knowledge of observability tooling, preferably with Datadog
Hands-on SRE experience within AWS, including Lambda, containers, and IAM
Strong programming skills in Python and Ruby
Experience with Terraform and infrastructure as code (IaC) practices
Familiarity with incident management, on-call rotations, and SLAs
Ability to identify patterns and risks from telemetry and act on them proactively
Previous experience as a software developer or DevOps engineer
Knowledge of reliability strategies for containerized workloads
Comfortable contributing to CI pipelines and deployment strategies
Experience working in environments with limited QA/BA handoffs
Languages: Python, Ruby
Cloud: AWS (Lambda, ECS, IAM)
IaC: Terraform
Observability: Datadog
Workflow: Agile (Scrum), Jira, Git, CI/CD pipelines
Observability-first culture – You won’t just respond to alerts; you’ll design the systems that prevent them.
Hands-on impact – You’ll drive real improvements that increase uptime, performance, and engineering confidence.
Autonomy & ownership – Work independently while contributing to high-performing global teams.
Real scale challenges – Help support large-scale, distributed systems with meaningful end-user impact.
Virtasant is a global cloud consulting and technology company operating across 130+ countries. We deliver transformative solutions across cloud cost optimization, software engineering, technology operations, and AI. Our projects are meaningful, our teams are globally distributed, and our culture is built on autonomy, trust, and technical excellence.