Enable job alerts via email!

Data Engineer MidSenior

Yassir

Cape Town

On-site

ZAR 480,000 - 720,000

Full time

18 days ago

Boost your interview chances

Create a job specific, tailored resume for higher success rate.

Job summary

A leading super app company, Yassir, is seeking a Data Engineer to build and optimize data pipelines and integrate various data solutions. This role involves collaborating with cross-functional teams to ensure data accuracy and quality, and leveraging Google Cloud Platform services to enhance analytics capabilities. Join us in transforming the digital landscape and creating impactful solutions for users.

Qualifications

  • Experience with data pipelines and ETL processes.
  • Strong skills in data validation and quality checks.
  • Familiarity with data governance and modelling.

Responsibilities

  • Build centralized data lakes and data processing pipelines.
  • Collaborate with Data Science teams for advanced analytics.
  • Design and manage ETL/ELT processes.

Skills

PySpark
GCP Big Query
Dataproc
Dataflow
Dataplex
PubSub
Cloud Storage
Advanced SQL
NoSQL
Scala
Python
Airflow

Tools

Terraform
Docker
Kubernetes
Looker Studio

Job description

Yassir is the leading super App in the Maghreb region set to changing the way daily services are provided. It currently operates in 45 cities across Algeria Morocco and Tunisia with recent expansions into France Canada and SubSaharan Africa . It is backed $200M in funding by VCs from Silicon Valley Europe and other parts of the world.

We offer ondemand services such as ridehailing and lastmile delivery. Building on this infrastructure we are now introducing financial services to help our users pay save and borrow digitally.

Helping usher the continent into a digital economy era. Were not just about serving people were about creating a marketplace to bring people what they need while infusing social values.

What youll do

  • Build a centralized data lake on GCP Data services by integrating diverse data sources throughout the enterprise.
  • Develop maintain and optimize SPARKpowered batch and streaming data processing pipelines. Leverage GCP data services for complex data engineering tasks and ensure smooth integration with other platform components
  • Design and implement data validation and quality checks to ensure datas accuracy completeness and consistency as it flows through the pipelines.
  • Work with the Data Science and Machine Learning teams to engage in advanced analytics.
  • Collaborate with crossfunctional teams including data analysts business users and operational and marketing teams to extract insights and value from data.
  • Collaborate with the product team to design implement and maintain the data models for analytical use cases.
  • Design develop and upkeep data dashboards for various teams using Looker Studio.
  • Engage in technology explorations research and development POCs and conduct deep investigations and troubleshooting.
  • Design and manage ETL / ELT processes ensuring data integrity availability and performance.
  • Troubleshoot data issues and conduct root cause analysis when reporting data is in question.

Required Technical Skills

  • PySpark
  • GCP Big Query Dataproc Dataflow Dataplex PubSub and Cloud Storage
  • Advanced SQL knowledge
  • NoSQL (Preferably MongoDB)
  • Programming languages Scala / Python
  • Familiarity with workflow management tools like Airflow Prefect or Luigi.
  • Understanding of Data Governance DWH and Data Modelling

Good to have skills

  • Infrastructure as Code Terraform
  • Docker and Kubernetes
  • Looker Studio
  • AI and ML engineering knowledge

At Yassir we believe in the power of diversity and the importance of an inclusive culture. So if youre ready to bring your unique perspective and experiences to the table then were excited to listen.

Dont just apply for a job come and be a part of our journey. Lets create a better tomorrow together.

We look forward to receiving your application!

Best of luck

Key Skills

Apache Hive,S3,Hadoop,Redshift,Spark,AWS,Apache Pig,NoSQL,Big Data,Data Warehouse,Kafka,Scala

Employment Type : Full-Time

Experience : years

Vacancy : 1

Create a job alert for this search
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.