Job Search and Career Advice Platform

Ativa os alertas de emprego por e-mail!

Databricks Data Engineer

Globalsource It

Teletrabalho

BRL 50.000 - 70.000

Tempo integral

Há 2 dias
Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A data-focused company is seeking a hands-on Databricks Data Engineer for a fully remote contract role. The ideal candidate will have strong experience building and optimizing data pipelines using Databricks, Spark, and SQL, while focusing on ingesting and transforming data for analytics. Responsibilities include creating curated datasets, delivering insights via Power BI, and ensuring data quality and accuracy. This position offers an excellent opportunity to work in a dynamic and remote environment.

Qualificações

  • Strong hands-on Databricks experience with Spark, PySpark, and Delta Lake.
  • Advanced skills in SQL for large-scale data processing.
  • Solid understanding of performance tuning and debugging of Spark jobs.

Responsabilidades

  • Build and optimize ETL/ELT pipelines using Databricks.
  • Ingest and transform data from various sources.
  • Design datasets and deliver insights through Power BI.

Conhecimentos

Databricks experience
Advanced SQL
Distributed computing understanding
Power BI
CI/CD pipelines in Azure

Ferramentas

Spark
PySpark
Delta Lake
Apache Airflow
SAP
Descrição da oferta de emprego
Databricks Data Engineer – Fully Remote Contract

We're looking for a hands‑on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta Lake.

This role focuses on ingesting data from multiple sources, transforming it for analytics, and publishing high‑quality datasets and visualizations.

Responsibilities
  • Build and optimize ETL / ELT pipelines in Databricks using PySpark, Spark SQL, and Delta Lake.
  • Ingest, clean, and transform data from diverse sources (APIs, SQL databases, cloud storage, SAP / legacy systems, streaming).
  • Develop reusable pipeline frameworks, data validation logic, and performance‑tuned transformations.
  • Design curated datasets and deliver insights through Power BI dashboards.
  • Implement best practices for lakehouse development, orchestration, and version control.
  • Troubleshoot pipeline performance issues and ensure data accuracy, reliability, and quality.
Required Skills
  • Strong hands‑on Databricks experience (Spark, PySpark, Delta Lake).
  • Advanced SQL for large‑scale data processing.
  • Experience integrating data from multiple structured and unstructured sources.
  • Solid understanding of distributed computing, performance tuning, and debugging Spark jobs.
  • Power BI (reports, models, DAX preferred).
  • Experience with CI/CD pipelines in an Azure environment.
Nice to Have
  • Experience with data quality frameworks, Lakehouse monitoring, or DQX.
  • Knowledge of Airflow, ADF, IoT, Kafka or other tools is a plus.
  • Experience with SAP data is a plus.
Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.