
Ativa os alertas de emprego por e-mail!
Cria um currículo personalizado em poucos minutos
Consegue uma entrevista e ganha mais. Sabe mais
A fast-moving AI company is seeking a Platform Operations Team Lead based in Brazil to oversee the health of AI-powered vision systems. You will lead a team ensuring operational readiness and system reliability across LATAM and European time zones. The ideal candidate will possess strong leadership and technical skills, experience in DevOps, and proficiency in tools like Datadog and AWS. This role offers hybrid and remote work flexibility.
Mindhive builds AI‑powered vision systems that transform industrial production. As we scale globally, reliability, observability, and rapid issue response are critical. The Platform Operations Team Lead (Brazil) plays a central role in ensuring our systems remain healthy across LATAM and European time zones.
This role focuses on :
You will collaborate closely with the NZ‑based Platform Engineering team, who drive deep engineering projects (Puppet 8 rollout, Python 3.13 upgrade, CDK migration, CI / CD consistency, platform hardening, test‑rig reliability). Together, you will form Mindhive’s global reliability backbone.
This is a hands‑on leadership role with high impact on customer experience, system uptime, and our ability to scale installations worldwide.
You are a strong operational leader with deep hands‑on technical skills. You thrive in live production environments, enjoy solving real‑world system issues, and understand how to build reliable systems across time zones.
You excel in :
You care about technical quality, clarity, and people — and you bring a mindset focused on resilience, collaboration, and steady improvement.
Lead and grow the Platform Operations team across Brazil and Portugal.
Build a high‑performing follow‑the‑sun operational capability that supports both internal teams and customers.
Establish clear daily operational rhythms, including alert review, ticket management, and incident response.
Mentor engineers and technicians across Brazil and Portugal.
Create a culture of ownership and continuous improvement.
Ensure communication is clear, predictable, and aligned with our values.
Build a team that is highly accountable, collaborative, and customer‑focused.
Own the quality and accuracy of Datadog dashboards, alerts, service catalog, resource catalog, and operational visibility.
Reduce alert noise, improve signal quality, and ensure teams receive actionable information.
Develop and maintain runbooks, playbooks, and operational documentation.
Oversee first‑line and second‑line incident response during LATAM and EU hours.
Ensure fast, structured triage for issues across cloud, on‑premise, and edge deployments.
Maintain clear escalation paths and strong communication practices during incidents.
Partner with Implementation and Customer Success teams to resolve client‑facing issues.
Act as the operational counterpart to NZ Platform Engineering.
Ensure operational readiness for major engineering initiatives, such as :
Provide field feedback, operational insights, and rollout support for these improvements.
Monitor the health of live systems across sites and proactively identify stability risks.
Help drive improvements in :
Work with teams to reduce operational toil and automate repetitive tasks.
Experience leading distributed teams across multiple time zones.
Excellent communication in English and Portuguese.
Ability to collaborate effectively with engineering, implementation, and customer‑facing teams.
Strong organisational skills with ability to manage competing priorities.
Strong background in DevOps, SRE, or Production Engineering environments.
Hands‑on experience operating hybrid cloud + on‑premise / edge systems.
Proficiency with :
Solid programming skills in Python (TypeScript / JavaScript is a plus).
Understanding of security best practices (identity, access, endpoint, and network security).
Experience running incident response, on‑call processes, or follow‑the‑sun operations.
Proven ability to write and maintain runbooks, playbooks, and operational documentation.
Experience supporting industrial, IoT, or hardware‑integrated systems (ideal).
Mindhive Ltd is a fast‑moving AI company using machine learning and computer vision to reimagine industrial systems. Our products run across cloud, on‑premise, and edge deployments, bringing AI performance and reliability directly to the factory floor.
We care deeply about people, quality, and impact. We work collaboratively, iterate quickly, and tackle meaningful, complex problems.
Mindhive is a New Zealand Hi‑Tech Awards winner, recognised for innovation and impact in software, AI, and advanced manufacturing.
We support hybrid and remote work, with our people distributed across Brazil, Portugal, Italy, Japan and New Zealand. We trust each other to deliver results in ways that suit our lives while maximising our collective impact. We move quickly, adapt fast, and support each other through the ups and downs that come with building something new and meaningful.