Agile Robots SE is seeking a Senior System Administrator to operate and evolve our global on-premise compute, operating systems, storage, backups, and identity services. You will engineer and run the foundational server platform—hardened, patch-compliant, recoverable, and ready for our applications and data platforms—across multiple regions and data centers.
Your Responsibilities
- Compute & OS Lifecycle
Operate and optimize hypervisors and clusters (VMware/KVM/Proxmox), including HA, DRS/placement, and firmware/BIOS lifecycle
Provision and maintain Linux and Windows servers with clear baselines, SLAs, and change control
Enforce patching windows and automate OS updates with maintenance/rollback plans
- Provisioning, Imaging & Configuration Management
Deliver repeatable provisioning: PXE/iPXE workflows, golden images with Packer, and unattended installs
Implement configuration management at scale (Ansible/Salt; WSUS/Chocolatey; Red Hat Satellite) with drift detection and remediation
Maintain hardened images aligned to CIS/STIG; document MOP/SOP and rollback for all changes
- Storage, Backup & Disaster Recovery
Operate block/file/object storage (SAN/NAS/Ceph/ZFS): performance tuning, quotas, snapshots/replication
Ensure backup coverage and restorability with Veeam (policies, immutability where applicable, app-aware backups)
Conduct scheduled restore tests; document and meet RPO/RTO; participate in DR exercises
- Directory, Identity & Access (optional specialization)
Administer AD/LDAP/FreeIPA and federation/SSO (Keycloak/MFA), service accounts, and RBAC guardrails
Automate join/leave/move flows (SCIM/connectors), GPO/Policy hygiene, and privileged access controls
Integrate identity with platforms (e.g., Linux/Windows domain join, sudo/rights delegation)
- Server Security Hygiene & Logging
Apply CIS baselines and remediate OS/middleware vulnerabilities within agreed SLAs
Manage certificate/PKI lifecycle (issuance, rotation, ACME automation, expiry alerting)
Operate log shipping (rsyslog/agents) to ELK/OpenSearch; ensure audit coverage for privileged actions
- Reliability, Observability & Lifecycle Management
Build availability patterns (dual-homing, LACP/MLAG where applicable, fast recovery) for critical services
Implement monitoring/alerting (e.g., Zabbix), telemetry/SNMP, and capacity trending
Keep the CMDB accurate (CIs, owners, relationships), track EOL/EOS, and produce capacity/cost reports
Use Git-based workflows to version playbooks/configs and deliver safe, repeatable changes
7+ years administering enterprise Linux/Windows in multi-site environments, including change and incident workflows
Hands-on with at least two hypervisor stacks (e.g., VMware + Proxmox/KVM) and cluster operations (HA, live migration, storage moves)
Strong in provisioning and config at scale: PXE/iPXE, Packer, Ansible/Salt; WSUS/Chocolatey or Satellite
Storage fundamentals (iSCSI/FC/NFS/SMB) with practical Ceph or ZFS experience (pool design, snapshots, replication)
Veeam at scale: policies, proxies/repos, application-aware backups, restore testing, and RPO/RTO discipline
Directory/IAM administration: AD (GPO, DNS, Sites & Services) plus FreeIPA/Keycloak and SSO/MFA patterns
Security hygiene and logging: CIS alignment, vuln remediation, certificate lifecycle, rsyslog → ELK/OpenSearch
Scripting/automation (Bash/PowerShell; Python a plus) and Git-based change workflows
Beneficial Skills
- ZFS send/receive; Ceph RBD/RGW tuning; storage performance troubleshooting
- Immutable/air-gapped backups; off-site replication; DR orchestration
- Exposure to Kubernetes underlay needs (storage classes/CSI, backup hooks like Velero/restic)
- Experience with CMDB tooling (e.g., NetBox/ServiceNow/JIRA CMDB) and asset discovery
- Familiarity with syslog retention strategies, Zabbix tuning, and ITSM/change tooling
What we offer
- A dynamic high-tech company combined with financial soundness and world-class investors
- Join an interdisciplinary, international team with 58+ different nationalities in a collaborative work environment
- Lots of development opportunities in the context of our continued growth
- Challenging tasks and impactful projects alongside experts that enable professional and personal growth
- Corporate Benefits Program that covers health, mobility, and learning with 100€ net per month
- Modern office facilities with a rooftop terrace overlooking Munich, free drinks & fruits, and regular company events contribute to a good working environment
Agile Robots SE is an international high-tech company based in Munich, Germany with a production site in Kaufbeuren and more than 2300 employees worldwide. Our mission is to bridge the gap between artificial intelligence and robotics by developing systems that combine state-of-the-art force-moment-sensing and world-leading image-processing technology. This unique combination of technologies allows us to provide user-friendly and affordable robotic solutions that enable intelligent precision assembly.
This is made possible by our employees, who bring out the best in each and every day with creativity and enthusiasm. Become part of this team and shape the future of robotics with us!
We are proud of our diversity and welcome your application regardless of gender and sexual identity, nationality, ethnicity, religion, age, or disability.