Job Search and Career Advice Platform

¡Activa las notificaciones laborales por email!

Senior Site Reliability Engineer

Manychat

Barcelona

Híbrido

EUR 60.000 - 80.000

Jornada completa

Hace 28 días

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A leading Chat Marketing platform in Barcelona is seeking a Senior Site Reliability Engineer. This hybrid role involves managing AWS infrastructure, Kubernetes, and improving services within a collaborative team. Ideal for candidates with strong Linux and cloud expertise, the position offers competitive benefits including health insurance, professional development budget, and flexible working arrangements.

Servicios

Comprehensive health insurance
Professional development budget
Flexible benefits package
Hybrid work
In-office perks
Company-funded sport activities

Formación

  • 5+ years of experience managing Linux in production.
  • Strong experience with Kubernetes, Helm, and Terraform.
  • Solid understanding of networking and cloud security best practices.

Responsabilidades

  • Maintain and harden AWS infrastructure.
  • Operate EKS clusters powering Python-based AI services.
  • Migrate services to Kubernetes using Terraform.

Conocimientos

Linux management
Kubernetes
Terraform
Ansible
Networking
Cloud security
PostgreSQL operations
CI/CD best practices
Communication skills
Descripción del empleo

Manychat is a leading Chat Marketing platform. We help businesses engage with their customers on Instagram, Facebook Messenger, WhatsApp, and Telegram.

Trusted by over 1 million brands in 170+ countries, we’re an official Meta Business Partner, backed by top investors, including Bessemer Venture Partners.

With 200+ teammates across international offices in Barcelona, Austin, Amsterdam, São Paulo, and Yerevan — Manychat helps businesses across the globe improve their ROI and grow faster.

ABOUT THE ROLE

We’re looking for a Senior Site Reliability Engineer who thrives at the crossroads of classic Linux and AWS infrastructure and modern Site Reliability Engineering. This is a high‑impact, hybrid role designed for someone who can manage cloud resources, harden Kubernetes clusters, and shape a more reliable and developer‑friendly platform.

We need you not just to maintain but to rethink and evolve our infrastructure, balancing hands‑on operations with strategic improvements that future‑proof our growing AI product landscape.

WHY THE ROLE IS SPECIAL

You won’t be a cog in a massive SRE org. You’ll be the bridge between Infrastructure and Engineering, shaping how we scale Kubernetes, how we approach platform reliability, and how developers ship fast without fear. You’ll get autonomy, ownership, and a smart, humble team excited to learn with you.

WHAT YOU’LL DO
  • Maintain and harden AWS infrastructure (EC2, ALB/NLB, WAF, IAM, CloudWatch)
  • Operate and evolve our EKS clusters powering Python‑based AI services
  • Migrate existing services to Kubernetes using Terraform and Helm
  • Codify infrastructure with Terraform and manage host‑level automation via Ansible
  • Build and improve CI/CD pipelines with GitHub Actions
  • Own observability efforts: Prometheus, Grafana, alerting, and on‑call readiness
  • Support OS‑level patching, certs, WAF rules, and general infra hygiene
  • Partner with engineers to guide best practices and drive platform reliability
  • Create clean, maintainable infrastructure documentation and playbooks
  • Occasionally support rare off‑hours incidents
QUALIFICATIONS
  • 5+ years of experience managing Linux in production (Ubuntu, Amazon Linux)
  • Strong experience with Kubernetes (ideally EKS), Helm, and Terraform
  • Comfort with running and debugging Python workloads in containers
  • Solid understanding of networking, IAM, and cloud security best practices
  • Hands‑on Nginx experience (Ingress and reverse proxy setups)
  • Excellent communication skills; you can explain complex infra to devs clearly
  • Strong Ansible skills beyond the basics
  • PostgreSQL or Amazon RDS tuning and operations experience
  • Deep understanding of observability tools (Prometheus, Grafana, Loki, etc.)
  • Familiarity with PHP production environments
  • Experience with TDD, CI/CD best practices, and agile development
  • Any previous SRE‑like exposure such as building resilience, automation, or incident tooling
WHAT WE OFFER
  • 💙 Comprehensive health insurance for both you and your family.
  • 📚 Professional development budget for conference tickets, online courses, and other relevant resources to help you grow.
  • 🫶 Flexible benefits package to tailor perks that matters most for you.
  • 🪴 Hybrid work and generous leave options to prioritize your work‑life balance.
  • 🍽️ In‑office perks, including free meals and snacks.
  • 🤝 Company‑funded sport activities, annual offsites, and team‑building events.

Manychat is an Equal Opportunity Employer. We’re committed to building a diverse and inclusive team. We do not discriminate against qualified employees or applicants because of race, color, religion, gender identity, sex, sexual preference, pregnancy, national origin, ancestry, citizenship, age, marital status, physical disability, mental disability, medical condition, military status, or any other characteristic protected by local law or ordinance.

With my application, I accept the ManychatPrivacy Policy.

Interested in building your career at Manychat? Get future opportunities sent straight to your email.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.