Context
We are a fast-growing startup that has experienced tremendous growth over the past months, achieving 8x revenue growth in the last 18 months and a 10x usage increase over the same period. As we continue this upward trajectory, we are expanding our team to ensure our products remain reliable, intuitive, and delightful for our ever-growing user base.
Mission
As a DevOps/Infrastructure team member, your primary mission will be to help ensuring that our infrastructure is scalable, reliable, and cost-effective, supporting the company’s rapid growth and evolving needs. You will play a critical role in both day-to-day operations and long-term strategic planning, helping shape our platform\'s future.
Key Responsibilities
- Infrastructure Scaling and Stability: Contribute to the continuous scaling of our infrastructure to handle increasing loads while maintaining stability and performance. This includes designing and implementing robust and scalable architectures, automating deployment processes, and optimizing resource allocation.
- Performance Optimization: Continuously monitor and analyze system performance, identify bottlenecks and areas for improvement, and optimize infrastructure and applications to ensure low-latency, high-throughput operation, particularly for video and audio processing.
- Cost Management: Implement strategies to optimize infrastructure costs without compromising performance, including rightsizing resources, automating scaling policies, and leveraging cloud provider cost-saving mechanisms.
- Enhancing Observability: Improve monitoring and observability capabilities, developing and maintaining dashboards, alerts, and logs that provide actionable insights for the engineering team.
- Collaboration and Support: Work closely with developers and other stakeholders to ensure seamless integration between infrastructure and applications, fostering a culture of shared responsibility for reliability.
- Security and Compliance: Contribute to the security and compliance of our infrastructure by implementing best practices and staying up-to-date with industry standards.
- Innovation and Continuous Improvement: Stay abreast of industry trends and technologies and proactively suggest and implement improvements, with opportunities to experiment with new tools and methodologies.
Who We Are Looking For
We seek a highly skilled and experienced DevOps/Infrastructure Engineer who is passionate about building and maintaining scalable, reliable, and efficient infrastructure. You should have a strong background in managing complex environments across cloud providers and bare-metal servers, and be comfortable working in high-performance environments, proactively identifying and resolving potential issues before they impact the system.
Key Qualifications
- Kubernetes Mastery: Proven experience designing, deploying, and managing Kubernetes clusters in production environments.
- Multi-Cloud and Bare Metal Expertise: Deep understanding of at least one primary cloud provider (AWS, GCP, or Azure) and experience managing infrastructure on bare-metal servers.
- Infrastructure at Scale: Demonstrated ability to architect and manage infrastructure that can scale horizontally and vertically, with experience in distributed systems, load balancing, and automated scaling strategies.
- Cost Optimization & Resource Management: Proficient in monitoring resource usage, identifying inefficiencies, and implementing cost-saving measures.
- Programming and Automation: Proficiency in scripting and automation (e.g., Bash, Python).
- Monitoring and Observability: Strong understanding of monitoring, logging, and observability; experience with tools such as Prometheus, Grafana, ELK/Elastic Stack or equivalent.
- Problem-Solving Mindset: Ability to troubleshoot and continuously improve system reliability, performance, and efficiency.
- Collaboration and Communication: Excellent written and verbal communication skills, able to articulate complex technical concepts to non-technical stakeholders and collaborate cross-functionally.
Bonus Qualifications
- Experience with Media Processing: Familiarity with video and audio processing, streaming, or related technologies.
- Familiarity with Node.js environments and applications at scale.
- Familiarity with configuration management tools like Ansible, Terraform, or Puppet.
Summary
We’re looking for a DevOps Engineer to help us build and scale the foundation behind cutting-edge immersive entertainment experiences. You will play a key role in shaping our infrastructure-as-code, streamlining deployments, and ensuring our systems are secure, observable, and resilient. If you enjoy working with modern cloud technologies, container orchestration, and automation in a dynamic environment, we’d love to hear from you.
Core Requirements
- Hands-on experience with infrastructure as code using Pulumi (TypeScript).
- Strong understanding of AWS services (IAM, VPC, S3, EC2, EKS, security groups).
- Solid knowledge of Kubernetes fundamentals (pods, services, deployments, ingress).
- Experience with CI/CD pipelines using GitHub Actions.
- Familiarity with containerization using Docker and deployment workflows with Helm, Kustomize, and FluxCD.
- Understanding of networking concepts (DNS, load balancers, ingress controllers) and securely integrating managed services (e.g., MongoDB Atlas) with Kubernetes.
- Proven experience in monitoring and observability with Grafana, Loki, and Prometheus.
Nice to Have
- Experience with build & release management for TypeScript and/or Rust applications.
Other Traits
- Strong problem-solving and troubleshooting skills.
- Ability to work in cross-functional teams with engineers from multiple disciplines.
- Passion for automation, scalability, and infrastructure best practices.
Job Description
About you
You are someone who wants to influence your own development. You’re looking for a company where you have the opportunity to pursue your interests and be able to grow professionally.
You will be accountable for the following responsibilities
- Infrastructure Management: Manage both employee and organizational infrastructure to ensure seamless operations.
- VPN Configuration & Management: Configure and manage VPN access to secure connections for employees.
- Cloud Identity Provider (IdP) Management: Oversee JumpCloud or equivalent cloud IDP services for secure user management.
- Employee Onboarding & Offboarding: Manage workstation setups, configurations, and removals during employee transitions.
- Infrastructure as Code (IaC): Implement, manage, and maintain infrastructure as code using Terraform for streamlined infrastructure deployment.
- AWS Account & IAM Management: Oversee AWS IAM roles, permissions, and account setups for both employees and applications.
- Database Cluster Management: Scale and manage database clusters and connectivity using AWS RDS, ensuring high availability and performance.
- Microservices Infrastructure: Maintain and optimize microservices infrastructure on AWS EKS (Elastic Kubernetes Service).
- Monitoring & Alerting: Design and implement robust monitoring and alerting systems with Datadog to ensure proactive issue detection and resolution.
- Continuous Integration/Continuous Deployment (CI/CD): Develop and maintain CI/CD pipelines to support efficient, reliable, and secure deployment processes.
- SOC2 Compliance: Ensure all company data systems meet SOC2 standards and regularly review for compliance adherence.
- Information and Network Security: Establish, enforce, and enhance network security measures, following industry best practices.
- Penetration Testing & Audits: Conduct recurring penetration tests and audits as per company policies, addressing vulnerabilities promptly.
Additional Information
Here at Applaudo Studios values as trust, communication, respect, excellence and team work are our keys to success. We know we are working with the best and thus treat each other with respect and admiration without asking.
Submit your application today, and don\'t miss this opportunity to join the Best Digital team in the Region!
We truly appreciate all the hard and outstanding work our team makes every day at Applaudo Studios, and that\'s why the perks that we offer, are deeply thought and designed as a way to thank them for their commitment and excellence.
Some of our perks and benefits:
- Celebrations
- Entertainment area*
- Modern Work Spaces*
- Great work environment
- Private medical insurance*
*Benefits may vary according to your location and/or availability. Request further information when applying.
Role Description
ONLY CANDIDATES FROM BRAZIL!
This is a contract remote role for a DevOps Engineer based in Brazil. The DevOps Engineer will be responsible for managing and automating infrastructure, developing software, maintaining continuous integration pipelines, and system administration tasks. This cross-functional role hands-on expertise in building, automating, and managing complex cloud environments on Microsoft Azure. The ideal candidate will thrive in a collaborative, agile environment and work closely with analysts, developers, and architects to deliver secure, scalable, and innovative solutions.
Responsibilities
- Design, develop, deploy, and maintain infrastructure, applications, and services on Microsoft Azure.
- Build and optimize DevOps pipelines (CI/CD) using Azure DevOps, leverage YAML-based and Classic releases
- Implement Infrastructure as Code (IaC) using ARM, Bicep, and Terraform for repeatable, secure deployments
- Monitor, troubleshoot, and optimize Azure environment performance across compute, networking, storage, and data services
- Implement governance, security, and compliance best practices with services such as Azure Key Vault, Managed Identities, and RBAC
- Support integration and administration of Azure Data Factory, Azure Databricks, Function Apps, Logic Apps and other services
- Develop monitoring, logging, and observability frameworks using Azure Monitor, Application Insights with a focus on distributed tracing and telemetry
- Collaborate with cross-functional teams to design scalable solutions aligned with enterprise architecture standards
Qualifications
- Bachelor's degree in Computer Science, Information Technology, or related field is a plus
- Strong written, presentation, and verbal communication skills
- 5+ years of experience deploying, configuring, and administering applications and services in Microsoft Azure
- Strong expertise in automation, source control, CI/CD, configuration management, and Azure DevOps practices
- Hands-on experience creating and managing both YAML-based and Classic Azure DevOps pipelines
- Proficiency in Python, PowerShell, and Bash for automation and scripting
- 3+ years of direct experience with Azure cloud resource configuration and administration
- Demonstrated knowledge of monitoring principles and implementation of observability solutions
- Working knowledge of the Azure Data Ecosystem (ADF, Databricks, Synapse, AKS, Service Bus, Event Hubs, Key Vault, Purview, Function Apps, Logic Apps, Azure Storage)
- Previous experience in systems, database, or application administration is highly desirable
Your role and responsibilities:
- Design and develop Database-as-a-Service solutions for MS SQL using .NET, PowerShell, Python, and other automation tools.
- Leverage cutting-edge AWS Cloud technologies to build and innovate database infrastructure.
- Perform performance analysis and tuning for MS SQL environments.
- Automate and streamline database management processes.
- Conduct POC analysis and document complex, business-critical database workflows.
- Lead incident analysis and provide corrective action recommendations.
- Develop and implement database alerting and monitoring solutions.
- Build and maintain cloud infrastructure as part of our database product offerings.
- Own the full DevOps lifecycle: design, code, test, deploy, and monitor.
Requirements and Qualifications
- Bachelor’s degree in Computer Science, Engineering, or related field.
- Deep of experience in database engineering and automation.
- Strong experience with AWS and services like CFN, SSM, and container technologies.
- Deep expertise in MS SQL administration, including HA, backup/restore, security, and performance tuning.
- Proficiency in PowerShell scripting for Windows environments.
- Familiarity with DevOps frameworks and Agile/Scrum methodologies.
- Excellent communication and documentation skills.
- Advanced English
Cargo