
Ativa os alertas de emprego por e-mail!
Cria um currículo personalizado em poucos minutos
Consegue uma entrevista e ganha mais. Sabe mais
A leading independent software provider in São Paulo is seeking a Site Reliability Engineer to manage cloud deployments on platforms like Azure, AWS, and GCP. You will ensure service availability, optimize architectures, and automate CI/CD pipelines. Candidates should have expertise in cloud technologies, networking, and scripting languages. This role involves collaborative work with teams to enhance product quality and requires strong problem-solving skills. Competitive compensation and a dynamic work environment await the right candidate.
As a Site Reliability Engineer, you will play a key role in supporting existing customers with their managed or private cloud deployments, as well as in launching new deployments on major cloud platforms such as Azure, AWS, and GCP. Your mission will include ensuring the smooth operation, scalability, and security of cloud services, as well as automating processes to increase both efficiency and reliability.
1. Deployment Setup and Management:
Lead the design and implementation of new cloud deployments, tailoring solutions to meet stakeholder requirements on platforms like Azure, AWS, GCP, and Kubernetes.
Optimize cloud architectures for scalability and cost-effectiveness, adhering to best practices for networking, security, and access controls.
Gain and maintain deep knowledge of cloud infrastructure providers to create robust solutions.
2. Automation and CI/CD:
Craft and manage automation scripts and infrastructure as code (IaC) with Terraform, Ansible, or CloudFormation.
Deploy CI/CD pipelines to streamline software delivery, testing, and deployment processes, ensuring efficient version control and configuration management.
3. Managed Cloud Support:
Ensure the availability of the services by configuring system monitors and alerts and attending to critical alerts in a timely manner.
Offer continuous support and maintenance for existing deployments, monitoring system performance and swiftly resolving issues to maintain high availability and reliability.
Implement strategies for performance optimization and failure prevention, conducting thorough root cause analyses to avoid future issues.
4. Monitoring and Security:
Establish comprehensive monitoring and alerting systems to oversee customer deployments, setting thresholds for incident response.
Conduct regular security assessments and stay abreast of the latest threats and trends to fortify cloud environments against risks.
5. Collaboration and Knowledge Sharing:
Foster a collaborative environment with product developers, operations, and QA teams to enhance workflows and product quality.
Share knowledge and best practices, contributing to the team’s collective expertise through documentation, training, and mentorship.
Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
Expertise in cloud platforms such as Azure, AWS and GCP.
Expertise in Linux, virtualization and containerization technologies such as Docker and Kubernetes.
A solid understanding of networking, security principles, and compliance frameworks.
Proficiency in IaC tools (Terraform, CloudFormation), configuration management (Puppet, Chef, Helm), and scripting languages (Python, Bash, PowerShell).
Experience with CI/CD tools (Github Actions, Jenkins) and monitoring/logging tools (Prometheus, ELK stack, Splunk).
Exceptional problem-solving, analytical, and troubleshooting skills, coupled with a proactive, customer-centric mindset.
Strong communication skills and the ability to collaborate effectively in a team environment.
Founded in 2005, our client is the largest independent software provider offering open source API management, integration, and identity and access management (IAM) to thousands of companies in more than 90 countries. The company\'s products and platforms enable organizations to unlock the full potential of artificial intelligence and APIs to securely deliver the next generation of AI-powered digital services and applications.
Our open source, AI-powered, API-centric approach frees developers and architects from single-vendor lock-in and enables rapid digital product creation.
Recognized as a leader by industry analysts, the company has more than 800 employees worldwide and offices in Australia, Brazil, Germany, India, Sri Lanka, the United Arab Emirates, the United Kingdom, and the United States, with more than $100 million in annual recurring revenue.