Site Reliability Engineer, Factory Software Systems (m/w/d) - Gigafactory Berlin-Brandenburg
Join Tesla as a Site Reliability Engineer, Factory Software Systems (m/w/d) at the Gigafactory Berlin-Brandenburg.
About Tesla
Tesla is accelerating the world's transition to sustainable energy. Our innovative strategies and products have been developed and launched on a large scale within a few years. This success relies on speed, innovation, and efficiency.
Gigafactory Berlin is a cornerstone of Tesla's success in Europe. Our employees' passion and engagement drive us toward our goals. We invite you to join and help expand this success story.
The Role
The Core Automation Services (CAS) team at Tesla develops applications to enable manufacturing with a focus on reliability, availability, scalability, speed, and security. Our diverse team includes Controls Automation Engineers, Software Engineers, and other disciplines supporting automated manufacturing processes.
As an SRE on the CAS team, you'll work with infrastructure, systems, and applications that serve as middleware between Programmable Logic Controllers (PLCs) and external systems like Databases, MES, and other services.
Responsibilities
- Support and enhance the interim HMI/SCADA vendor application (Ignition from Inductive Automation).
- Build tooling around it, evaluate its usage, and ensure its reliability, availability, and security.
- Design software and systems to enable automated manufacturing at Tesla.
- Assist Engineers in onboarding and integrating services into the Tesla stack (Kubernetes/VMWare/Bare-metal).
- Implement best practices for service observability, including metrics, logging, tracing, and alerting.
- Automate configuration and deployment of services.
- Design infrastructure, systems, and application architecture.
Qualifications
- Experience with Virtualization (vSphere) and/or Containerization (Kubernetes).
- Proficiency in Linux administration (Ubuntu 18.04/20.04).
- Understanding of networking concepts (Routing/Switching, VLANs, Firewalls, Load Balancers).
- Experience in high-level languages such as Go, Python, and/or Java.
- Knowledge of observability tools (Prometheus, AlertManager, Grafana, Jaeger, Splunk).
- Experience with Infrastructure as Code (Terraform/Ansible).
- CI/CD pipeline experience (GitHub Enterprise).
- Artifact management experience (Artifactory).
- Bare-metal imaging experience (Ubuntu netboot PXE).
- Strong proactive approach, willingness to get hands dirty and learn from mistakes.
- Hands-on experience as a DevOps/Site Reliability Engineer.
- Excellent documentation and knowledge sharing skills.
- Willingness to mentor team members with less experience.
- Comfortable on an on-call rotation and troubleshooting live issues.
What We Offer
Work at our state-of-the-art Gigafactory, solving some of the world's most interesting problems alongside passionate colleagues. Tesla offers a competitive salary, shares or bonuses, 30 vacation days, pension, insurance, free EV charging, product discounts, and transportation benefits.
Additional Details
- Seniority level: Mid-Senior level
- Employment type: Full-time
- Job functions: Engineering and IT
- Industries: Automotive, Renewable Energy, Semiconductor, Utilities
Referrals can double your chances of interview success. Get notified about new Site Reliability Engineer positions in Grünheide, Brandenburg, Germany.