Site Reliability Engineer (m/f/d)

Nur für registrierte Mitglieder
Weiterstadt
EUR 45.000 - 80.000
Jobbeschreibung

Your mission

  • Ensure reliability and scalability of Azure-based SaaS, PaaS, and IaaS environments.
  • Design infrastructure as a code (IaC) environment incl. automated deployment, scaling, and monitoring processes using tools such as Terraform or Ansible.
  • Develop and maintain CI/CD pipelines using GitHub Actions and Jenkins.
  • Monitor system health using tools like Azure Monitor, Application Insights, and Log Analytics.
  • Proactively identify and resolve performance bottlenecks, security vulnerabilities, and infrastructure issues incl. utilization of relevant metrics.
  • Implement observability and incident response frameworks with logging, alerting, and tracing solutions.
  • Partner with development teams and architects to improve reliability, deployment efficiency, and system resilience by thorough testing and release procedures.
  • Maintain compliance and security standards following industry best practices and frameworks (ISO 27001, SOC 2, NIST, etc.).
  • Ensure availability and rapid incident response.

What you bring as a perfect candidate

  • You thrive in collaborative environments and value collective success.
  • You approach challenges with a positive mindset and a proactive attitude.
  • You believe technology has the power to improve society and drive positive change for the future.

Your profile

  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.
  • 3+ years of experience in a Site Reliability Engineer, DevOps, or Cloud Engineer role.
  • Strong expertise in Microsoft Azure, including Azure Kubernetes Service (AKS), Azure Functions, App Services, Virtual Machines, and Networking.
  • Proficiency in scripting/programming using structured and object-oriented approaches with Python, Java, JavaScript, or Bash.
  • Hands-on experience with Infrastructure as Code (IaC) using Terraform.
  • Experience with CI/CD tools such as GitHub Actions and Jenkins.
  • Knowledge of containerization and orchestration (including Docker, Kubernetes).
  • Experience with distributed storage technologies as well as dynamic resource management frameworks (Kubernetes, AKS) is a plus.
  • Familiarity with monitoring and logging tools (including Azure Monitor, Prometheus, Grafana).
  • Understanding of security best practices for cloud infrastructure and application security.
  • Strong problem-solving skills and ability to work in fast-paced, collaborative environments with a proactive mindset.

What we'll offer

  • Parking space
  • Employee discounts
  • Employee events
  • Dogs welcome
  • Home office
  • Flexible working hours
  • Sports and Fitness Options

Why us?

  • Join a dynamic, motivated team that values collaboration and innovation.
  • Experience the energy and agility of a fast-growing tech startup where every day brings new opportunities.
  • Take ownership of projects and bring your ideas to life, shaping the direction of our work.
  • We are dedicated to supporting your growth with resources and opportunities to advance your role and career.
  • Your work will have a direct, positive impact on both the company and society, contributing to meaningful change.