Remote Site Reliability Engineer
Ontem
At INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work.
Overview
We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure OpenShift/Kubernetes clusters. We will need you to approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation, and reliability.
Key responsibilities
- Build and automate and maintain OpenShift/Kubernetes clusters.
- Create and enhance tools to make operational workflows more automated.
- Configure and maintain additional required supporting infrastructure applications.
- Monitor, respond to, and resolve Cluster and infrastructure service issues.
- Handle infrastructure and services on prem and in AWS.
- Diagnose and resolve problems in OpenShift and/or Kubernetes clusters.
- Implement metrics to measure service performance and health.
Requirements
- 5+ years of experience as a Site Reliability Engineer.
- Deep experience with Linux Administration.
- Automation experience with Python, Bash, Salt, or equivalent.
- Knowledge installing, managing, maintaining, and troubleshooting OpenShift/Kubernetes clusters.
- Advanced English level.
• Flexibility: Choose where and how you work for enhanced creativity and innovation.
• Tailored Compensation: Personalize your earnings to suit your financial goals.
• Tech-Driven Tools: Access cutting-edge resources for seamless collaboration and productivity.
• Autonomous Workflow: Take control of your schedule to achieve work-life balance.
• Well-being: Enjoy generous leave policies for rest and rejuvenation.
• Diversity & Inclusion: Thrive in a diverse and inclusive environment.
• Collaboration: Engage with industry leaders for collective growth.
• Development: Access mentorship and growth opportunities for continuous advancement.
If you are interested in being part of a team composed of the best professionals and working 100% goal-oriented in an innovative environment, but with the structure and resources of a multinational market leader, do not hesitate to apply!
Ontem
Context
We are a fast-growing startup that has experienced tremendous growth over the past months, achieving 8x revenue growth in the last 18 months and a 10x usage increase over the same period. As we continue this upward trajectory, we are expanding our team to ensure our products remain reliable, intuitive, and delightful for our ever-growing user base.
Mission
As a DevOps/Infrastructure team member, your primary mission will be to help ensuring that our infrastructure is scalable, reliable, and cost-effective, supporting the company’s rapid growth and evolving needs. You will play a critical role in both day-to-day operations and long-term strategic planning, helping shape our platform's future.
What You’ll Do
- Design and deploy cloud-native systems on AWS, that just work — at scale.
- Own Kubernetes clusters: architecture, automation, observability, and performance tuning.
- Build Terraform modules and GitOps pipelines to make deployments boring (in a good way).
- Create CI/CD pipelines that are fast, reliable, and get new features into production safely.
- Automate everything — provisioning, monitoring, security — with the best tools for the job.
- Push systems to 99.99% uptime across billions of transactions a year.
- Partner directly with product engineers, security, and leadership to ship infrastructure that accelerates growth.
- Mentor teammates, review pull requests, and help define best practices as we scale.
You Should Have
- 7+ years of experience in DevOps, Cloud Engineering, or Site Reliability Engineering.
- Strong Proven track record with LAMP (Linux Apache Mysql PHP)
- Strong real-world experience with AWS, Kubernetes, and Terraform.
- A deep understanding of Linux, containers, networking, and cloud-native security.
- Proven ability to scale systems to handle high-volume, high-availability workloads.
- Hands-on skills in scripting or coding — Python, Go, or Bash are all great.
- An eye for automation and eliminating manual work wherever you find it.
- Startup or growth-stage experience (bonus points).
- Excellent async communication skills (Slack, GitHub, Docs).
Bonus Points If You’ve
- Built infrastructure for AI/ML systems, SaaS, or telco-scale apps.
- Used tools like GitHub Actions, ArgoCD, Vault, Elastic Stack, or EKS.
- Led technical teams or been a mentor for junior engineers.
- Got certifications like AWS Solutions Architect, CKA, or Azure Admin.
About Us
We’re Flowmentum and our clients are fast-moving teams building reliable, scalable, and secure infrastructure for companies shaping the future of AI, fintech, cloud services, and beyond.
Our engineers work on high-traffic, mission-critical systems that power millions of users across the globe.
We believe in autonomy, ownership, and solving hard problems — at scale. If you’re passionate about infrastructure done right, we want you on the team.
What You’ll Do
- Design and deploy cloud-native systems on AWS, that just work — at scale.
- Own Kubernetes clusters: architecture, automation, observability, and performance tuning.
- Build Terraform modules and GitOps pipelines to make deployments boring (in a good way).
- Create CI/CD pipelines that are fast, reliable, and get new features into production safely.
- Automate everything — provisioning, monitoring, security — with the best tools for the job.
- Push systems to 99.99% uptime across billions of transactions a year.
- Partner directly with product engineers, security, and leadership to ship infrastructure that accelerates growth.
- Mentor teammates, review pull requests, and help define best practices as we scale.
You Should Have
- 7+ years of experience in DevOps, Cloud Engineering, or Site Reliability Engineering.
- Strong Proven track record with LAMP (Linux Apache Mysql PHP)
- Strong real-world experience with AWS, Kubernetes, and Terraform.
- A deep understanding of Linux, containers, networking, and cloud-native security.
- Proven ability to scale systems to handle high-volume, high-availability workloads.
- Hands-on skills in scripting or coding — Python, Go, or Bash are all great.
- An eye for automation and eliminating manual work wherever you find it.
- Startup or growth-stage experience (bonus points).
- Excellent async communication skills (Slack, GitHub, Docs).
Bonus Points If You’ve
- Built infrastructure for AI/ML systems, SaaS, or telco-scale apps.
- Used tools like GitHub Actions, ArgoCD, Vault, Elastic Stack, or EKS.
- Led technical teams or been a mentor for junior engineers.
- Got certifications like AWS Solutions Architect, CKA, or Azure Admin.
Ontem
- Employment type: 12-month contract with possibility of renewal depending on business needs and personal performance. Pay in USD.
- Location: Remote in Latin America. Preferred locations include Peru, Mexico, Brazil, Chile, Bolivia, Ecuador, or Colombia.
- Timezone: Must be able to work on US time zone.
About Us
Concord isn't your typical consulting firm; we are an execution company with a passion for making things happen. Our mission is to help clients enhance customer experiences, optimize operations, and revolutionize their product offerings through seamless integration, optimization, and activation of technology and data.
We are purpose-built, merging the industry’s top specialty companies to amplify our Innovation Capabilities in analytics & AI, data management & engineering, UX and digital experience development, and technical platform integration, automation & security engineering.
About The Role
We are seeking a Lead DevOps Engineer to join our client’s Global Business Services team, where the mission is to improve and transform internal technology infrastructure, including IT operations, workplace technologies, cybersecurity, architecture, engineering services, and enterprise systems.
The Lead DevOps Engineer is responsible for designing, building, and optimizing enterprise-scale software systems and cloud platforms. This role combines deep technical expertise with leadership in DevSecOps, platform engineering, and cloud architecture.
You’ll drive innovation, ensure engineering excellence, and lead cross-functional initiatives to deliver secure, scalable, and high-performing solutions. In addition to technical execution, you’ll play a key role in mentoring teams, influencing technical strategy, and integrating emerging technologies such as generative AI into engineering practices.
You’ll report to the Manager of DevSecOps Engineering and work alongside a collaborative team of engineers. As part of a global DevOps community, you’ll join a network of professionals who collaborate, support, and inspire each other while investing in continuous learning and innovation.
What You’ll Do
- Lead the design and engineering of scalable, secure, and resilient enterprise software and cloud platforms.
- Define and implement best practices in software development, DevSecOps, and infrastructure automation.
- Oversee deployment, monitoring, and maintenance of platforms using SRE principles.
- Architect and manage cloud infrastructure (AWS, Azure, GCP) with focus on performance, cost-efficiency, and compliance.
- Lead the development and optimization of CI/CD pipelines and Infrastructure as Code (IaC) practices.
- Guide the adoption of containerized and serverless architectures (Docker, Kubernetes, AKS, Azure Functions).
- Ensure adherence to security standards, conduct risk assessments, and drive compliance initiatives.
- Translate complex business requirements into scalable technical solutions and roadmaps.
- Mentor and coach engineering teams, fostering a culture of collaboration and continuous improvement.
- Explore and integrate generative AI technologies to enhance automation and software capabilities.
What We’re Looking For
- Bachelor’s degree in Computer Science, Information Systems, or related technical field.
- 7+ years of experience in software engineering, DevSecOps, or platform engineering.
- 3+ years in a technical leadership role.
- 2+ years of experience with AI and generative AI technologies, or equivalent exposure.
- Proven background in enterprise-scale application and infrastructure operations.
- Advanced scripting skills (Bash, Python) for automation and infrastructure tasks.
- Deep expertise with programming frameworks (.NET, Java) and modern application development.
- Extensive experience with Infrastructure as Code (IaC) tools such as Terraform, including module development.
- Proficiency with CI/CD tools (Azure DevOps, Jenkins, GitLab CI, GitHub Actions).
- Strong architectural knowledge of cloud platforms (AWS, Azure, GCP).
- Familiarity with SRE practices and tools for monitoring, incident management, and performance tuning.
- Excellent problem-solving and decision-making skills in complex environments.
- Strong communication and collaboration skills, with the ability to influence stakeholders and lead teams.
- Relevant certifications (e.g., Azure/AWS DevOps, PSD) are preferred.
Summary
We’re looking for a DevOps Engineer to help us build and scale the foundation behind cutting-edge immersive entertainment experiences. You will play a key role in shaping our infrastructure-as-code, streamlining deployments, and ensuring our systems are secure, observable, and resilient. If you enjoy working with modern cloud technologies, container orchestration, and automation in a dynamic environment, we’d love to hear from you.
Core Requirements
- Hands-on experience with infrastructure as code using Pulumi (TypeScript).
- Strong understanding of AWS services (IAM, VPC, S3, EC2, EKS, security groups).
- Solid knowledge of Kubernetes fundamentals (pods, services, deployments, ingress).
- Experience with CI/CD pipelines using GitHub Actions.
- Familiarity with containerization using Docker and deployment workflows with Helm, Kustomize, and FluxCD.
- Understanding of networking concepts (DNS, load balancers, ingress controllers) and securely integrating managed services (e.g., MongoDB Atlas) with Kubernetes.
- Proven experience in monitoring and observability with Grafana, Loki, and Prometheus.
Nice to Have
- Experience with build & release management for TypeScript and/or Rust applications.
Other Traits
- Strong problem-solving and troubleshooting skills.
- Ability to work in cross-functional teams with engineers from multiple disciplines.
- Passion for automation, scalability, and infrastructure best practices.
Cargo
Ontem
Job Description
About you
You are someone who wants to influence your own development. You’re looking for a company where you have the opportunity to pursue your interests and be able to grow professionally.
You bring to Applaudo the following competencies:
- Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
- 3+ years of experience in a DevOps or related role.
- Proficiency in AWS cloud services, specifically IAM, RDS, and EKS.
- Experience with infrastructure as code (IaC) tools, such as Terraform.
- Strong background in setting up and managing CI/CD pipelines.
- Familiarity with monitoring tools, especially Datadog.
- Knowledge of SOC2 compliance standards and security best practices.
- Experience with VPN configuration, Cloud IdP (such as JumpCloud), and network security.
- Certifications in AWS (e.g., AWS Certified DevOps Engineer) or security (e.g., CISSP) are desirable.
- Experience with additional cloud providers (e.g., Azure, GCP) is a plus.
- Proficiency in scripting and automation tools (e.g., Python, Bash) is desirable.
- Familiarity with microservices architecture and container orchestration.
- Advanced english proficiency.
You will be accountable for the following responsibilities:
- Infrastructure Management: Manage both employee and organizational infrastructure to ensure seamless operations.
- VPN Configuration & Management: Configure and manage VPN access to secure connections for employees.
- Cloud Identity Provider (IdP) Management: Oversee JumpCloud or equivalent cloud IDP services for secure user management.
- Employee Onboarding & Offboarding: Manage workstation setups, configurations, and removals during employee transitions.
- Infrastructure as Code (IaC): Implement, manage, and maintain infrastructure as code using Terraform for streamlined infrastructure deployment.
- AWS Account & IAM Management: Oversee AWS IAM roles, permissions, and account setups for both employees and applications.
- Database Cluster Management: Scale and manage database clusters and connectivity using AWS RDS, ensuring high availability and performance.
- Microservices Infrastructure: Maintain and optimize microservices infrastructure on AWS EKS (Elastic Kubernetes Service).
- Monitoring & Alerting: Design and implement robust monitoring and alerting systems with Datadog to ensure proactive issue detection and resolution.
- Continuous Integration/Continuous Deployment (CI/CD): Develop and maintain CI/CD pipelines to support efficient, reliable, and secure deployment processes.
- SOC2 Compliance: Ensure all company data systems meet SOC2 standards and regularly review for compliance adherence.
- Information and Network Security: Establish, enforce, and enhance network security measures, following industry best practices.
- Penetration Testing & Audits: Conduct recurring penetration tests and audits as per company policies, addressing vulnerabilities promptly.
Additional Information
Here at Applaudo Studios values as trust, communication, respect, excellence and team workare our keys to success. We know we are working with the best and thus treat each other with respect and admiration without asking.
Submit your application today, and don't miss this opportunity to join the Best Digital team in the Region!
We truly appreciate all the hard and outstanding work our team makes every day at Applaudo Studios, and that's why the perks that we offer, are deeply thought and designed as a way to thank them for their commitment and excellence.
Some of our perks and benefits:
- Celebrations
- Entertainment area*
- Modern Work Spaces*
- Great work environment
- Private medical insurance*
*Benefits may vary according to your location and/or availability. Request further information when applying.