Aktiviere Job-Benachrichtigungen per E-Mail!

Site Reliability Engineer (f/m/d) Application Hosting/TOSAAS

Ionos

Karlsruhe

Hybrid

EUR 65.000 - 85.000

Vollzeit

Vor 30+ Tagen

Erstelle in nur wenigen Minuten einen maßgeschneiderten Lebenslauf

Überzeuge Recruiter und verdiene mehr Geld. Mehr erfahren

Zusammenfassung

Ein führender Anbieter von Cloud-Diensten sucht einen Site Reliability Engineer (f/m/d) in Karlsruhe. Sie sind verantwortlich für die Weiterentwicklung der Produktinfrastruktur und die Automatisierung mit Tools wie Terraform und Kubernetes. Erwartet werden mehrere Jahre Erfahrung in einer ähnlichen Funktion sowie gute Deutsch- und Englischkenntnisse. Die Stelle bietet flexible Arbeitszeiten und zahlreiche Entwicklungsmöglichkeiten.

Leistungen

Flexible Arbeitszeiten

Subventionierte Kantine

Moderne Büroräume

Mitarbeiterevents

Fort- und Weiterbildungsmöglichkeiten

Gesundheitsangebote

Qualifikationen

Mehrjährige Erfahrung als Site Reliability Engineer oder ähnliche Rolle.
Sehr gute Kenntnisse im Linux-Betriebssystem und Kubernetes.
Erfahrung mit Infrastructure as Code, bevorzugt Terraform.
Sicher im Entwickeln in mindestens einer Programmiersprache.
Erfahrung mit hochverfügbaren und verteilten Produktionsumgebungen.

Aufgaben

Weiterentwicklung der Produktinfrastruktur und Integration neuer Produkte.
Sichere Operation unserer Produktplattform.
Automatisierung der Infrastruktur mit Tools wie Terraform.
Analyse und Lösung komplexer Probleme in verteilten Systemen.
Entwicklung und Pflege von Monitoring-Lösungen.

Kenntnisse

Kubernetes

Terraform

Monitoring und Alerting

Python

Tools

Prometheus

Grafana

GitLab CI/CD

ELK Stack

Site Reliability Engineer (f/m/d) Application Hosting/TOSAAS

At IONOS, the leading European provider of cloud infrastructure, cloud services and hosting services, you will work together with a wide range of teams. We are characterized by open structures, a friendly working culture and flat hierarchies with a strong team spirit. We firmly believe that work and fun are compatible, and offer you the right environment for this. Our constant growth means that we are always looking for new colleagues. Become part of IONOS and grow with us.

As a Site Reliability Engineer (SRE) in our Application Hosting Team, you will form the technical backbone of our product platform for Managed Nextcloud, IONOS GPT, and other web services that we operate on our Kubernetes platform. Together with experienced colleagues, you will design new services and products that remain high-performing and fail-safe even under the highest loads.

Tasks

Your main area of responsibility will be the further development of our product infrastructure and the integration of new products/web services into our Kubernetes and cloud infrastructure.
You will be responsible for the stable and secure operation of our product platform. Your expertise will be in demand when it comes to in-depth analysis and optimization of our primarily containerized and Kubernetes-based application infrastructure.
You live and breathe automation. Using tools such as Terraform, Gitlab CI/CD, and ArgoCD, you will provision and manage our entire infrastructure in a declarative and reproducible manner.
You analyze and resolve complex problems in a distributed system landscape and work on the continuous improvement of our platform.
You develop and maintain our monitoring, logging, and alerting solution (e.g., with Prometheus, Grafana, ELK Stack) to proactively identify bottlenecks and sources of error.

Qualifications

You have several years of experience as a Site Reliability Engineer or in a related role (Linux System Administrator, Platform Engineer, DevOps Engineer, Full Stack Developer) in a Linux and Kubernetes environment.
You have very good knowledge and several years of experience in using the Linux operating system, container technologies, and specifically Kubernetes.
You have experience with Infrastructure as Code (preferably Terraform), CI/CD pipelines (e.g., GitLab CI/CD or GitHub Actions), and in the use and application of Helm Charts.
You are confident in developing in at least one programming or scripting language (e.g., Go, Python, Bash) to solve automation and monitoring tasks.
Experience with operating and troubleshooting highly available and distributed production environments, including monitoring, alerting, and log analysis of distributed applications (e.g., Prometheus, Grafana, FluentD, ELK, VictoriaMetrics, icinga).
You have a proactive, solution-oriented, independent way of working and the ability to systematically analyze and sustainably resolve complex technical problems.

Language: Good German and English skills are required.

Location: Berlin or Karlsruhe.

Flexible working hours through trust-based working hours.
At some locations a subsidized canteen and various free drinks.
Modern office space with very good transport connections.
Various employee discounts for activities and products.
Employee events such as summer and winter parties, as well as workshops.
Numerous training and development opportunities.
Various health offers, such as sports and health courses.

We value diversity and welcome all applications - regardless of, for example, gender, nationality, ethnic or social origin, religion, disability, age as well as sexual orientation and identity, physical characteristics, marital status or any other irrelevant factor subject to applicable law.

Further information on privacy as part of the application process, including the list of the affiliates, can be found here:privacy policy

Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.

eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.

Top-Städte

Top-Unternehmen

Beliebte Jobs