Job Search and Career Advice Platform

Aktiviere Job-Benachrichtigungen per E-Mail!

Senior DevOps / Site Reliability Engineer (full time)

Divio

Remote

EUR 60.000 - 80.000

Vollzeit

Vor 15 Tagen

Erstelle in nur wenigen Minuten einen maßgeschneiderten Lebenslauf

Überzeuge Recruiter und verdiene mehr Geld. Mehr erfahren

Zusammenfassung

A leading cloud infrastructure company in Germany seeks a Site Reliability Engineer to ensure the reliability, performance, and security of both their multi-cloud PaaS and AWS infrastructure. The ideal candidate will have solid expertise in Docker, AWS, and programming (especially Python and TypeScript). You'll enjoy autonomy in your role and contribute to meaningful changes in a collaborative team setting. This position offers the opportunity to work across various technologies in a dynamic and flexible environment.

Leistungen

Flexible working hours
Remote work opportunity
Personal development opportunities

Qualifikationen

  • Solid expertise in Docker and AWS is essential.
  • Experience with programming, especially in Python and TypeScript, is required.
  • Strong foundational knowledge in Linux, networking, and the TCP/IP stack.

Aufgaben

  • Ensure reliability and performance through incident response and troubleshooting.
  • Implement long-term improvements such as migrations and security enhancements.
  • Assist internal support for technical questions and external teams.

Kenntnisse

Docker
AWS
Python
TypeScript
Infrastructure as Code
Linux
Networking
TCP/IP stack

Tools

Ansible
AWS CDK
GitHub Actions
Jobbeschreibung

Passionate about infrastructure and security, your head always in the cloud, and looking for new challenges? You might be a great fit for us at Divio. We’re a small hosting provider, but we punch above our weight with ambitious goals and an infrastructure that keeps things exciting.

What is Divio?

Divio is a remote-first company of 18 passionate people spread across Europe. We’re small, focused, and genuinely into tech. Our main product is a multi-cloud Platform-as-a-Service (PaaS) that simplifies the deployment of Docker-based applications, a great fit for partner agencies and developers who want to move fast without getting lost in DevOps complexity. Feel free to try our Divio Cloud platform if you want to see what we’re building!

In parallel, we manage a large-scale AWS setup for a single enterprise client, powering over 100 highly available websites. These two projects live on opposite ends of the cloud spectrum – one on minimal abstractions, one deep in the AWS ecosystem – and that mix is what keeps our work challenging and fun.

More importantly, we care about how we work. We value curiosity, ownership, and clarity. There’s no heavy hierarchy here – everyone brings ideas, and we build together. We’re looking for someone who’s not just technically sharp but genuinely enjoys solving problems with others.

What tech are we playing with?
Divio Cloud
  • Our custom-built PaaS is based on Docker and Django microservices.
  • Designed to be portable across clouds, it’s built on primitives like VMs, object storage, load balancers, and managed DBs (e.g., EC2, S3, RDS/OpenSearch).
  • Deployed with Ansible reinforced with Python and custom tooling.
  • External tools include Datadog, Redis, Elasticsearch, Nessus, and more.
  • The platform supports any Dockerized app, which means we often get to explore new stacks, languages, and frameworks to help clients.
Enterprise AWS Platform
  • This environment is almost the opposite: purely AWS, highly serverless.
  • Static frontends built with Gatsby and Storyblok, enhanced with Lambda APIs.
  • Infrastructure as Code is written in AWS CDK (Typescript).
  • Other tools in the mix: GitHub Actions, Cloudflare, DynamoDB, API Gateway, etc.

It’s a landscape that requires flexibility: jumping between stacks, mindsets, and even languages! If you variety, you’ll thrive here.

What’s the job?

As a Site Reliability Engineer (SRE) at Divio, you’ll take care of the reliability, performance, security, and cost-effectiveness of both infrastructures. You won’t be doing it alone, but you will have autonomy and influence.Your week-to-week will involve:

  • Keeping the lights on: patching, tuning, incident response, and keeping the stack healthy
  • Pushing long-term improvements: migrations, internal tooling, security hardening, monitoring revamps
  • Shaping the infrastructure: evolving our setup to stay modern, secure, and developer-friendly
  • helping: our internal support crew for technical questions or the external dev team with their day-to-day

We aim to balance quick fixes and deep refactors so there’s always something meaningful to work on.

Our workflows:
  • On the Divio Cloud side, we work in 2-week sprints guided by quarterly OKRs. It’s a flexible, engineer-led process where priorities are set together. We keep things lightweight and adaptive, and rotate support/on-call weekly across the team.
  • The Enterprise AWS project follows a more structured 3-week Scrum cycle, with regular planning and retrospectives. It involves tighter coordination with the client and an external development team.
Are you the one?

We work across a wide spectrum of technologies and patterns. If you’re an SRE who:

  • Has solid experience in infrastructure and software (Python and Typescript are our main tools)
  • Enjoys switching contexts and solving real-world problems
  • Doesn’t mind complexity, and even enjoys taming it
  • Can work independently but wants to build with a team
  • Has ideas and wants a say in how things evolve

Then you’ll probably feel at home with us.

We're not looking for someone who knows everything, just someone who’s curious, reliable, and ready to grow with us.

In summary...

We’re a small team working on interesting infrastructure with a lot of freedom and responsibility. There’s real variety in the work, and plenty of opportunities to learn, improve things, and have an impact, both on the platform and on how we work as a team!

If that sounds like your kind of job, we’d really like to hear from you.

Must have:
  • Excellent command of written and spoken English
  • Solid expertise in Docker, AWS, and cloud-native infrastructure
  • Programming experience, ideally in Python and TypeScript (but Go, Java, etc. also welcome)
  • Experience with configuration management and Infrastructure as Code, ideally Ansible and AWS CDK
  • Strong foundational knowledge (Linux, networking, the TCP/IP stack, load balancing, etc.)
  • A reliable and proactive mindset, you take responsibility and can work independently
  • Comfort with support tasks and professionalism when communicating with clients
Nice to have:
  • Hands-on Linux system administration and tuning experience
  • Familiarity with Django or other Python web frameworks
  • Experience using AWS CDK, specifically with TypeScript
  • Operational knowledge of services like PostgreSQL, Redis, RabbitMQ, Elasticsearch
  • Any experience with the tools and technologies mentioned in our stack
Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.
eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.