Softensity Inc
ADV Space
Xp Inc.
Platform Builders
Velozient
Connect with headhunters to apply for similar jobsVelozient
Radity
Niteo Technologies
Softensity Inc
Smart Services Tecnologia Ltda
A technology solutions company in Brazil is seeking a Site Reliability Engineer (SRE) to contribute to building resilient systems and ensuring platform performance, reliability, and security. You will work closely with Central SRE, DevOps, and Agile teams to maintain stability and scalability of a distributed, cloud-based platform on Azure. Candidates should have a Bachelor's degree in a technical discipline and 5+ years of experience in SRE or related roles. The company offers remote work and learning opportunities.
We at Softinity are looking for a Site Reliability Engineer (SRE) – This is a dynamic and hands-on role within a global, collaborative SRE environment. The SRE Technical Member will contribute to building resilient systems, automating operations, and ensuring the platform meets high standards for performance, reliability, and security.
You will be working closely with Central SRE, DevOps, InfoSec, and Agile development teams to maintain platform stability, scalability, and performance.
The Platform is a distributed, cloud-based system serving hundreds of geographically dispersed clients. It operates on Microsoft Azure using a microservices architecture, combining open-source, licensed, and internally developed tools for provisioning, deployment, monitoring, and logging.
SREs own the entire production stack — from application functionality to infrastructure resilience — ensuring availability, reliability, and scalability in a 24 / 7 operational environment.
This role requires problem-solving through data, collaboration, and technical expertise, maintaining a balance between engineering innovation and practical delivery.
Collaborate with Central SRE, DevOps, and InfoSec teams on new projects, platform builds, and deployments.
Contribute to the design, implementation, and operation of large-scale, Azure-based platforms.
Apply industry best practices in monitoring, alerting, reporting, and cloud architecture.
Participate in infrastructure, application, and security planning, focusing on scalability, redundancy, and data preservation.
Support high-availability topologies with development teams.
Produce documentation and weekly operational status reports, detailing project progress and key metrics.
Provide engineering and support for technical infrastructure, cloud, databases, and application performance.
Manage incident response, change management, and user permissions following SRE best practices (Google SRE model).
Maintain close collaboration between Application, Central SRE, DevOps, InfoSec, and business units.
Assist in configuring and onboarding new applications into the Azure DevOps (ADO) platform.
Strong understanding of SRE fundamentals : monitoring, alerting, reporting, performance, availability, and incident response.
Hands‑on experience with CI / CD tools (Git, Azure Pipelines, Ansible, etc.).
Deep knowledge of Azure Web Services — installation, configuration, and management.
Experience administering Microsoft applications (.NET, C#, Angular) with focus on automation, optimization, and security.
Proficiency in Cosmos DB and MS SQL operational tasks.
Excellent troubleshooting, root‑cause analysis, and problem‑solving skills.
Experience with disaster recovery, scalability testing, and capacity planning.
Expertise with cloud deployment and automation tools (Git, Azure DevOps, Ansible, etc.).
Ability to automate routine deployment, monitoring, and administrative tasks.
Write and maintain documentation and custom tools for monitoring and performance optimization.
Proficiency in Shell scripting and API troubleshooting for production support.
Experience designing, authoring, and maintaining .NET / C# code.
Capability to deliver hotfixes and operational patches (.NET & Angular).
Working knowledge of automation scripting languages for operational tools development.
Bachelor's degree in a technical discipline (Computer Science, Engineering, or related field).
5+ years of industry experience in SRE, DevOps, or related technical operations roles.
Proven experience in cloud infrastructure, automation, and application reliability engineering within large-scale, enterprise environments.
We are passionate about top quality talent and giving our team members the tools they need for them to keep on growing and learning. The sky is truly the limit and we want you to feel challenged and motivated in every single project that you're a part of all while working with cutting edge technologies and amazing clients.
What to expect?
Coursera credentials
Remote work
Softinity is an equal‑opportunity employer. All qualified applicants are considered without regard to gender, identity, or personal background.
* The salary benchmark is based on the target salaries of market leaders in their relevant sectors. It is intended to serve as a guide to help Premium Members assess open positions and to help in salary negotiations. The salary benchmark is not provided directly by the company, which could be significantly higher or lower.