Enable job alerts via email!

Senior DevOps Engineer – Remote

Replika

London

Remote

GBP 60,000 - 90,000

Full time

2 days ago
Be an early applicant

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading company in AI companionship is seeking a Senior DevOps Engineer to enhance scalable infrastructure and support machine learning workflows. In this hands-on role, you'll collaborate with teams to ensure platform robustness for millions of users and utilize tools like AWS, Docker, and Kubernetes. Join a remote-first culture that values innovation and a mission-driven approach while contributing to a product that impacts millions.

Benefits

Competitive compensation
Remote work flexibility
Team offsites in various countries
High-responsibility environment

Qualifications

  • 5+ years of hands-on experience in DevOps or cloud infrastructure.
  • Strong expertise in multi-cloud including AWS and GCP.
  • Excellent communication skills in English (B2 or higher preferred).

Responsibilities

  • Design, build, and maintain scalable infrastructure for AI applications.
  • Implement MLOps workflows and deploy machine learning models.
  • Monitor system performance and troubleshoot issues proactively.

Skills

Infrastructure as Code
Cloud Infrastructure
MLOps
Monitoring and Alerting
Containerization

Tools

AWS
GCP
Docker
Kubernetes
MLFlow

Job description

Social network you want to login/join with:

An AI companion who is eager to learn and would love to see the world through your eyes. Replika is always ready to chat when you need an empathetic friend.

About Replika

Replika is an AI companion loved by 35M+ users worldwide. We're redefining what it means to connect with technology - emotionally, intelligently, and personally. From mobile to VR, we're building an experience that feels less like software and more like someone who gets you. Our team is mission-first, future-facing, and here to create something wonderful. We value agency, room for magic, and a relentless pursuit of good.

About the Role

We're looking for a Senior DevOps Engineer to join our globally distributed, remote-first team. This is a hands-on, high-impact role for someone who thrives in a fast-paced environment and is passionate about building scalable, reliable, and secure infrastructure for cutting-edge AI applications. You'll work closely with engineering, AI, and analytics teams to ensure our platform is robust, performant, and ready to support millions of users around the world.

What You'll Be Doing
  • Design, build, and maintain scalable infrastructure across cloud, on-premises, and hybrid environments to support our rapidly growing AI platform.
  • Support AI teams and MLOps workflows by implementing specialized tooling, monitoring, and deployment pipelines for machine learning models.
  • Automate deployment, monitoring, and scaling of services using modern DevOps tools and practices across diverse infrastructure environments.
  • Ensure high availability, reliability, and security of production and staging environments in multi-cloud and hybrid setups.
  • Collaborate with AI and backend engineers to streamline CI/CD pipelines optimized for ML workflows and bring new features to production.
  • Monitor system performance and troubleshoot issues proactively, implementing solutions to prevent downtime across distributed infrastructure.
  • Drive infrastructure as code (IaC) initiatives to improve repeatability and reduce manual intervention across all deployment environments.
  • Implement and maintain monitoring, logging, and alerting systems specifically designed for AI workloads and model performance tracking.
  • Participate in on-call rotations and respond to production incidents with deep understanding of AI system requirements.
Who You Are
  • 5+ years of hands-on experience in DevOps, cloud infrastructure, or site reliability engineering.
  • Strong expertise in multi-cloud and hybrid infrastructure including AWS, GCP, and on-premises environments.
  • Experience with MLOps tooling such as MLFlow, Kubeflow, DataRobot, or similar platforms for ML lifecycle management.
  • Experience with containerization and orchestration (Docker, Kubernetes) specifically for ML workloads and GPU clusters.
  • Deep understanding of CI/CD pipelines for machine learning applications and model deployment automation.
  • Experience with specialized monitoring tools for AI systems including model performance tracking, data drift detection, and ML-specific alerting.
  • Understanding of GPU clusters, HPC environments, and specialized AI hardware deployment and management.
  • Excellent communication skills in English (B2 or higher preferred) with ability to translate technical concepts to stakeholders.
  • Passion for AI and technology, with deep curiosity about machine learning infrastructure and emerging AI technologies.
Bonus Points
  • Background in supporting data science teams and understanding of ML experimentation workflows.
  • Experience with edge computing and distributed AI inference infrastructure.
  • Previous startup experience building and scaling AI infrastructure from the ground up.
  • Knowledge of AI compliance and governance frameworks for production AI systems.
What You’ll Get
  • Competitive compensation
  • A chance to build a product that actually matters to millions of people
  • Freedom to work remotely with a globally distributed team
  • Offsites in different countries with people who actually like each other
  • A trustworthy, high-responsibility environment where your ideas really matter
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Similar jobs

Senior Platform Engineer

bunny.net

London

Remote

GBP 60,000 - 90,000

5 days ago
Be an early applicant

Senior DevOps Engineer – Kafka - real time, event driven, secure at Scale!

JR United Kingdom

London

Remote

GBP 70,000 - 100,000

12 days ago

Sr DevOps Engineer - Blockchain Gam...

Crypto Recruit

London

Remote

GBP 50,000 - 80,000

Yesterday
Be an early applicant

Azure DevOps Engineer

Op de Praatstoel

City Of London

Remote

GBP 45,000 - 70,000

3 days ago
Be an early applicant

Azure DevOps Engineer

Leap29

London

Remote

GBP 70,000 - 100,000

Yesterday
Be an early applicant

Senior DevOps Engineer

Tempting Ventures

Remote

GBP 55,000 - 70,000

2 days ago
Be an early applicant

DevOps Engineer - Ansible

JR United Kingdom

City Of London

Remote

GBP 50,000 - 80,000

3 days ago
Be an early applicant

Senior DevOps Engineer

ZipRecruiter

Milton Keynes

Remote

GBP 80,000 - 92,000

21 days ago

DevOps Engineer

Wallet in Telegram

London

Remote

GBP 70,000 - 110,000

10 days ago