Enable job alerts via email!

Data Engineer

ALLTECH CONSULTING SVC INC

Mississauga

On-site

CAD 80,000 - 120,000

Full time

30+ days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

An established industry player is seeking a skilled Cloud Data Engineer with expertise in Python, Pyspark, and SQL. This role involves building and optimizing data pipelines, designing Spark programs in Databricks, and ensuring data quality for enterprise-level systems. You'll collaborate with business users and analysts, implementing best practices for data management and governance. If you have a passion for cloud data architectures and a track record of successful ETL pipeline development, this opportunity is perfect for you to make a significant impact in a dynamic environment.

Qualifications

  • 5+ years experience in Python and SQL for large datasets.
  • Proficient in developing ETL pipelines using Databricks Pyspark.

Responsibilities

  • Build and optimize data pipelines for efficient data ingestion and transformation.
  • Design and deploy Spark programs in Databricks for data analysis.

Skills

Python
SQL
Pyspark
Data Warehousing
ETL Pipelines
Cloud Data Architecture
Data Modeling
Delta Lake
Event-based Technologies
Cloud Certification
Airflow

Tools

Databricks
SQL Server
Snowflake
Synapse
Big Query
Redshift

Job description

Job Description :

This position is for a Cloud Data engineer with a background in Python, Pyspark, SQL and data warehousing for enterprise level systems. The position calls for someone that is comfortable working with business users along with business analyst expertise.

Major Responsibilities:

  • Build and optimize data pipelines for efficient data ingestion, transformation and loading from various sources while ensuring data quality and integrity.
  • Design, develop, and deploy Spark program in databricks environment to process and analyze large volumes of data.
  • Experience of Delta Lake, DWH, Data Integration, Cloud, Design and Data Modelling.
  • Proficient in developing programs in Python and SQL
  • Experience with Data warehouse Dimensional data modeling.
  • Working with event based/streaming technologies to ingest and process data.
  • Working with structured, semi structured and unstructured data.
  • Optimize Databricks jobs for performance and scalability to handle big data workloads.
  • Monitor and troubleshoot Databricks jobs, identify and resolve issues or bottlenecks.
  • Implement best practices for data management, security, and governance within the Databricks environment. Experience designing and developing Enterprise Data Warehouse solutions.
  • Proficient writing SQL queries and programming including stored procedures and reverse engineering existing process.
  • Perform code reviews to ensure fit to requirements, optimal execution patterns and adherence to established standards.

Skills:

  • 5+ years Python coding experience.
  • 5+ years – SQL Server based development of large datasets
  • 5+ years with Experience with developing and deploying ETL pipelines using Databricks Pyspark.
  • Experience in any cloud data warehouse like Synapse, Big Query, Redshift, Snowflake.
  • Experience in Data warehousing – OLTP, OLAP, Dimensions, Facts, and Data modeling.
  • Previous experience leading an enterprise-wide Cloud Data Platform migration with strong architectural and design skills.
  • Experience with Cloud based data architectures, messaging, and analytics.
  • Cloud certification(s).
  • Any experience with Airflow is a Plus.

Must Have:

  • Snowflake
  • SQL
  • Python
  • ADF
  • PySpark
  • Data Warehouse Concepts
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.