¡Activa las notificaciones laborales por email!

Senior DevOps Engineer (Remote)

A5 Labs

Málaga

A distancia

USD 100.000 - 130.000

Jornada completa

Hace 2 días
Sé de los primeros/as/es en solicitar esta vacante

Descripción de la vacante

A leading tech company is seeking a Senior DevOps Engineer to design and manage cloud infrastructure for their poker applications. Candidates should have over 7 years of experience in DevOps, strong skills in AWS and Docker, and familiarity with MLOps workflows. The position offers a remote-friendly work culture, unlimited vacation policy, and opportunities to work on cutting-edge AI products.

Servicios

Unlimited vacation policy
Remote-friendly work culture
Opportunities for rapid feedback

Formación

  • 7+ years experience in DevOps or Infrastructure Engineering, including AI/ML workloads.
  • Strong AWS skills with experience in various AWS services.
  • Hands-on experience with Docker and orchestration platforms.

Responsabilidades

  • Design and build cloud infrastructure for poker applications.
  • Automate builds, testing, and deployments.
  • Establish robust MLOps and LLMOps workflows.

Conocimientos

DevOps principles
Cloud computing (AWS, Cloudflare)
CI/CD pipelines
Containerization (Docker)
Infrastructure as Code (Terraform)
MLOps workflows
Security best practices
Observability tools

Herramientas

GitHub Actions
MLflow
SageMaker
Datadog

Descripción del empleo

Senior DevOps Engineer (Remote) Madrid, Spain | Bucharest, Romania | Berlin, Germany | London, United Kingdom of Great Britain and Northern Ireland | Belgrade, Serbia | Barcelona, Spain | Cluj-Napoca, Romania Full Time At A5 Labs, we are committed to creating cutting-edge, AI-driven experiences that redefine industry standards. If you’ve ever played an online casino game, you may have already encountered our technology and innovation. The Role We’re seeking a Senior DevOps Engineer with expertise in MLOps and LLMOps to join our team. In this role, you will help us build and operate the infrastructure behind our poker applications, ensuring that it is secure, scalable, and efficient. You will work closely with product and engineering teams to enable a self-service approach, allowing developers to ship features faster and more reliably. From designing cloud infrastructure to automating deployments and establishing monitoring and incident management practices, you’ll be at the heart of how we scale our platform and teams.

You’ll also play a key role in supporting our MLOps and LLMOps workflows, helping scale AI model deployment and experimentation across our platform.

Key Objectives Design and build cloud infrastructure to run poker applications at scale, optimised for learning and exploration by recreational players worldwide. Optimise development workflows by automating builds, testing, and deployments while ensuring fast, reliable infrastructure to minimise friction and maximise developer focus.

Establish and maintain robust MLOps and LLMOps workflows to support the scalable development, reliable deployment, and continuous optimisation of LLMs at scale.

What you bring to the table

Experience 7+ years in DevOps / Infrastructure Engineering, including AI / ML workloads in production.

Cloud & Efficiency

Strong AWS and Cloudflare skills with hands-on experience in EB, ECS, RDS, MSK / Kinesis, CloudWatch, IAM, Lambda, S3, Route 53, etc., and a proven track record in infrastructure cost optimisation.

Multi-region & Scaling

Experience designing highly available, scalable, multi-region systems with disaster recovery strategies and cost optimisation.

Containerisation & Orchestration

Hands-on experience with Docker and orchestration platforms such as ECS, EKS, or Kubernetes.

Security & Reliability

Good understanding of cloud security best practices to ensure safe and resilient systems.

CI / CD & Observability

Experience with CI / CD pipelines, such as Bitbucket Pipelines or GitHub Actions, and observability tools like OpenTelemetry and Datadog or similar.

Infrastructure as Code

Proficient with Terraform or Pulumi for managing infrastructure.

MLOps & LLMOps

Experience supporting ML workflows and model lifecycle management using tools like MLflow and SageMaker.

Understanding of model versioning, experiment tracking, feature stores, scalable deployment, and challenges around LLM inference, fine-tuning, and performance observability.

Experience setting up incident processes, participating in on-call rotations, and resolving production issues.

Worked closely with engineering teams to build tailored infrastructure, provide reusable blueprints and self-service tooling, and promote DevOps best practices.

What We Offer

A fast-moving environment with minimal bureaucracy and quick decision-making

The opportunity to work on cutting-edge AI products and services

A strong focus on high-quality technical solutions

High autonomy and rapid feedback cycles

A great chance to learn how to play poker

Remote-friendly work culture

Unlimited vacation policy

Close collaboration with engineering teams and meaningful contributions to a shared product vision

This role is part of AceGuardian, a cutting-edge team within A5 Labs. AceGuardian is focused on building advanced AI agents through reinforcement learning, game-solving, fine-tuning, and planning. These AI agents tackle challenges such as anti-cheat detection (including collusion and bots) and optimising gameplay across various games. The team operates in stealth mode and is composed of experts in AI, machine learning, and game development, all working together to revolutionise both gaming and real-world problem-solving. By joining this team, you’ll contribute to innovative projects that push the boundaries of AI in the gaming industry while working alongside some of the brightest minds in the field.

J-18808-Ljbffr

J-18808-Ljbffr

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.