Daerah Khusus Ibukota Jakarta
On-site
IDR 200.000.000 - 300.000.000
Full time
Job summary
A leading transportation firm in Jakarta is seeking a Data Engineer to design and build data pipelines. The ideal candidate will have expertise in SQL and programming, particularly Python, along with a strong background in data engineering practices. Responsibilities include optimizing query performance and collaborating with analytics teams to fulfill data needs.
Qualifications
- 4+ years of experience in data engineering.
- Strong programming skills in Python.
- Familiarity with cloud infrastructure.
Responsibilities
- Design and build data pipelines from various sources to data warehouses/lakes.
- Develop complex transformations using SQL or programming languages.
- Collaborate with Data Analysts and Data Scientists to meet analytical data needs.
Skills
Advanced SQL
Python Programming
ETL/ELT Concepts
Data Modeling
Data Warehouse
Streaming Systems (Kafka, Pub/Sub)
Education
Bachelor's degree in Computer Science, Data Science, Statistics, or related field
Tools
SQL
Golang/Java
Airflow
Docker/Kubernetes
About the job Data Engineer
- Design and build data pipelines from various sources to data warehouses/lakes.
- Develop complex transformations using SQL or programming languages (Python).
- Implement data quality practices and error handling in pipelines.
- Optimize query performance and data storage.
- Prepare documentation for data models and system flows.
- Collaborate with Data Analysts and Data Scientists to meet analytical data needs.
Job Requirement
- Bachelors degree in Computer Science, Data Science, Statistics, or related field.
- 24 years of experience in data engineering.
- Proficient in advanced SQL.
- Strong programming skills in Python.
- Experience with Golang/Java is a plus.
- Understanding of ETL/ELT concepts, data modeling, and pipeline orchestration (e.g., Airflow).
- Experienced with data warehouse or data lake.
- Familiarity with streaming systems (Kafka, Pub/Sub) is a plus.
- Knowledge of CI/CD, containerization (Docker/Kubernetes), and cloud infrastructure.
- Understanding of data governance, metadata management, and data quality is an advantage.