Job Description
Shopware is a leading ecommerce system that enables companies worldwide to scale quickly and efficiently in digital commerce. As a trendsetting open source solution, Shopware is already used by some of the largest European brands, retailers and manufacturers in the B2C and B2B industries.
To turn data into an even stronger growth engine, our
Data & AI Lab builds the cloud-native backbone that powers analytics, machine learning, and AI-driven product features across the company.
As a
Data Engineer (m/f/d) in the
Data Platform & Enablement Team, you will design, build, and operate the data infrastructure that underpins this vision. You will work with a modern
AWS stack, Python/PySpark, and Infrastructure-as-Code to deliver reliable, scalable, real-time data solutions that our data scientists, analysts, and product engineers rely on every day.
This position can be filled
on-site, hybrid, but also full-remote within Germany or specific European countries where we are registered. Your work location will be agreed upon in advance and forms the basis of your employment contract.
These are your tasks:
Build & Evolve the Data Platform
- You design, implement, and operate a cloud-native data architecture on AWS (Kinesis, Glue, ECS, Step Functions, DynamoDB) that serves as the backbone for all “Data & AI” initiatives.
- You create and maintain Terraform modules that turn best practices into reusable building blocks.
Data Integration & APIs
- You develop robust ingestion pipelines for streaming and batch data.
- You craft and document RESTful APIs that expose curated datasets to internal and external consumers.
Data Processing & Quality
- You write efficient data-processing applications in Python and PySpark, ensuring well-tested, version-controlled code.
- You implement automated data-quality checks and observability so that “broken data” never reaches production.
CI/CD & Containerization
- You set up and maintain GitHub Actions pipelines and containerize services with Docker for seamless deployments and rollbacks.
Collaboration & Enablement
- You partner closely with cloud engineers, data scientists, analysts, and product teams to gather requirements and translate them into scalable solutions.
This is what you’ll bring to the table:
- Technical expertise: You have solid programming skills in Python and experience with Infrastructure-as-Code (ideally Terraform).
- Cloud & data services: You also have hands-on experience with AWS, real-time/streaming data processing, and all related AWS services.
- Data mindset: You bring a deep understanding of data modeling, pipelines, and analytical problem-solving.
- PySpark: You are familiar with PySpark for large-scale processing.
- Communication: Very good command of English and the enthusiasm to thrive in an international, fully remote team environment.
- Machine-learning and MLOps curiosity: You are genuinely interested in ML/AI; prior production experience is welcome but not required.
This is what we offer you:
- Company Culture: Open culture with flat hierarchies, where individual initiative is encouraged.
- Employment Contracts: Permanent positions that offer long-term security.
- Flexibility: Flexible working hours and options for mobile work and full-remote contracts.
- Equipment: Freedom to choose your preferred work hardware.
- Onboarding: Well-structured onboarding with support from a personal "buddy."
- Work Environment: An inspiring environment with dedicated colleagues and a dynamic community.
- Development Opportunities: Diverse opportunities for personal growth and development.
- Additional Benefits: Attractive perks such as company pension plans, health programs, and regular team events.
- and much more!
You can get a detailed insight on our career page.
Your personal contact for this position is Yanina Rudenko and is happy to answer any questions you may have!