
Ativa os alertas de emprego por e-mail!
Cria um currículo personalizado em poucos minutos
Consegue uma entrevista e ganha mais. Sabe mais
A global technology firm is seeking a Site Reliability Engineer (SRE) to contribute to building resilient and automated systems. You will work closely with various teams to maintain platform performance, reliability, and security within a distributed cloud-based environment on Microsoft Azure. This role demands extensive experience in SRE practices, cloud infrastructure, and automation tools, ensuring high availability and scalability in a 24/7 operational environment.
We at Softinity are looking for a Site Reliability Engineer (SRE) – This is a dynamic and hands-on role within a global, collaborative SRE environment. The SRE Technical Member will contribute to building resilient systems, automating operations, and ensuring the platform meets high standards for performance, reliability, and security.
You will be working closely with Central SRE, DevOps, InfoSec, and Agile development teams to maintain platform stability, scalability, and performance.
The Platform is a distributed, cloud-based system serving hundreds of geographically dispersed clients. It operates on Microsoft Azure using a microservices architecture, combining open-source, licensed, and internally developed tools for provisioning, deployment, monitoring, and logging.
SREs own the entire production stack — from application functionality to infrastructure resilience — ensuring availability, reliability, and scalability in a 24 / 7 operational environment.
This role requires problem-solving through data, collaboration, and technical expertise, maintaining a balance between engineering innovation and practical delivery.
Collaborate with Central SRE, DevOps, and InfoSec teams on new projects, platform builds, and deployments.
Contribute to the design, implementation, and operation of large-scale, Azure-based platforms.
Apply industry best practices in monitoring, alerting, reporting, and cloud architecture.
Participate in infrastructure, application, and security planning, focusing on scalability, redundancy, and data preservation.
Support high-availability topologies with development teams.
Produce documentation and weekly operational status reports, detailing project progress and key metrics.
Provide engineering and support for technical infrastructure, cloud, databases, and application performance.
Manage incident response, change management, and user permissions following SRE best practices (Google SRE model).
Maintain close collaboration between Application, Central SRE, DevOps, InfoSec, and business units.
Assist in configuring and onboarding new applications into the Azure DevOps (ADO) platform.
Strong understanding of SRE fundamentals : monitoring, alerting, reporting, performance, availability, and incident response.
Hands‑on experience with CI / CD tools (Git, Azure Pipelines, Ansible, etc.).
Deep knowledge of Azure Web Services — installation, configuration, and management.
Experience administering Microsoft applications (.NET, C#, Angular) with focus on automation, optimization, and security.
Proficiency in Cosmos DB and MS SQL operational tasks.
Excellent troubleshooting, root‑cause analysis, and problem‑solving skills.
Experience with disaster recovery, scalability testing, and capacity planning.
Expertise with cloud deployment and automation tools (Git, Azure DevOps, Ansible, etc.).
Ability to automate routine deployment, monitoring, and administrative tasks.
Write and maintain documentation and custom tools for monitoring and performance optimization.
Proficiency in Shell scripting and API troubleshooting for production support.
Experience designing, authoring, and maintaining .NET / C# code.
Capability to deliver hotfixes and operational patches (.NET & Angular).
Working knowledge of automation scripting languages for operational tools development.
Bachelor's degree in a technical discipline (Computer Science, Engineering, or related field).
5+ years of industry experience in SRE, DevOps, or related technical operations roles.
Proven experience in cloud infrastructure, automation, and application reliability engineering within large-scale, enterprise environments.
We are passionate about top quality talent and giving our team members the tools they need for them to keep on growing and learning. The sky is truly the limit and we want you to feel challenged and motivated in every single project that you're a part of all while working with cutting edge technologies and amazing clients.
What to expect?
Coursera credentials
Remote work
Softinity is an equal‑opportunity employer. All qualified applicants are considered without regard to gender, identity, or personal background.