Job Search and Career Advice Platform
  • Jobs
  • Headhunters
  • Free resume review
  • About Us
EN
331

Data Scientist jobs in South Africa

Data Engineer

Sand Technologies

South Africa
On-site
ZAR 400,000 - 800,000
30+ days ago
I want to receive the latest job alerts for “Data Scientist” jobs

Senior Data Engineer

NTT

Cape Town
On-site
ZAR 600,000 - 800,000
30+ days ago

Senior Data Engineer

NTT

Johannesburg
Hybrid
ZAR 700,000 - 900,000
30+ days ago

Data Analyst / Data Modeler

Exusia

Johannesburg
Remote
ZAR 600,000 - 800,000
30+ days ago

Senior Data Engineer

NTT Limited

Johannesburg
Hybrid
ZAR 700,000 - 900,000
30+ days ago
discover more jobs illustrationDiscover more opportunities than anywhere else. Find more jobs now

LMA Data Specialist

nedbank

Johannesburg
On-site
ZAR 450,000 - 600,000
30+ days ago

Data Engineer

InfyStrat

Johannesburg
On-site
ZAR 500,000 - 800,000
30+ days ago

Data Engineer

nedbank

Johannesburg
On-site
ZAR 600,000 - 800,000
30+ days ago
HeadhuntersConnect with headhunters to apply for similar jobs

Senior Data Analyst

Kuda

Cape Town
On-site
ZAR 800,000 - 1,200,000
30+ days ago

Software Engineer (Machine Learning)

Takealot Group

South Africa
Hybrid
ZAR 600,000 - 900,000
30+ days ago

Data Engineer- Ab Initio

Fourier IT Innovation (Pty) Ltd

Centurion
On-site
Confidential
30+ days ago

Data Analyst

The Global Talent Co.

Johannesburg
Remote
ZAR 300,000 - 600,000
30+ days ago

Senior Data Analyst

Kuda Technologies Ltd

Cape Town
Hybrid
ZAR 600,000 - 900,000
30+ days ago

Senior Machine Learning Developer

SavageOne Pty Ltd

Johannesburg
On-site
ZAR 800,000 - 1,200,000
30+ days ago

Data Analyst

InfyStrat Software Services

Johannesburg
On-site
ZAR 450,000 - 600,000
30+ days ago

Solar Systems Data Engineer

InspHired Recruitment

Randburg
On-site
ZAR 700,000 - 750,000
30+ days ago

Data Engineer

PBT Group

Johannesburg
On-site
ZAR 600,000 - 800,000
30+ days ago

Data Analyst

InfyStrat

Johannesburg
On-site
ZAR 400,000 - 600,000
30+ days ago

SQL Data Engineer (12 Month Contract) | Centurion

The Recruitment Council

Centurion
On-site
ZAR 600,000 - 900,000
30+ days ago

CLINICAL SCIENTIST - RESEARCH PROFESSOR (AC.05.3)

Wits University

Johannesburg
On-site
ZAR 1,000,000 - 1,500,000
16 days ago

Environmental Scientist

Hire Resolve

Ballito
On-site
ZAR 200,000 - 300,000
19 days ago

Medical Technologist / Medical Laboratory Scientist

JDJ Diagnostics

Potchefstroom
On-site
ZAR 250,000 - 350,000
20 days ago

Lead Researcher

Talent Sam

Gqeberha
On-site
ZAR 500,000 - 600,000
21 days ago

UX Researcher I

LexisNexis

Cape Town
On-site
ZAR 400,000 - 600,000
21 days ago

Ux Researcher I

RELX

Cape Town
Remote
ZAR 350,000 - 500,000
21 days ago

Top job titles:

Online jobsHris jobsFarm jobsSport jobsLogistic Manager jobsOffice jobsResearch jobsSuper Market jobsCar Sales jobsUrgent jobs

Top companies:

Jobs at Rand WaterJobs at MediclinicJobs at ShopriteJobs at MarriottJobs at BmwJobs at Ernst & YoungJobs at MicrosoftJobs at SapJobs at HuaweiJobs at Mastercard

Top cities:

Jobs in JohannesburgJobs in Cape TownJobs in DurbanJobs in PretoriaJobs in Port ElizabethJobs in BloemfonteinJobs in GqeberhaJobs in SowetoJobs in PietermaritzburgJobs in East London

Similar jobs:

Data Analysis jobsData Capture jobsData Entry jobsData Entry Remote jobsOnline Data Entry jobsBiomedical Scientist jobsJunior Data Analyst jobs
Data Engineer
Sand Technologies
South Africa
On-site
ZAR 400 000 - 800 000
Full time
30+ days ago

Job summary

Sand Technologies seeks a Data Engineer to design, build, and maintain scalable data pipelines for analysis and applications. You'll collaborate with cross-functional teams to ensure efficient data processing using big data technologies and cloud platforms. This role requires expertise in data modeling, ETL processes, and a solid understanding of data governance and quality.

Qualifications

  • Proven experience as a Data Engineer, or in a similar role.
  • Strong problem-solving skills to diagnose and resolve data issues.
  • Ability to write clean, scalable, robust code using Python or similar languages.

Responsibilities

  • Design and implement scalable data pipelines for ingesting and processing large volumes of data.
  • Collaborate with data scientists and engineers to define data architectures.
  • Monitor data pipelines, optimize for performance, and troubleshoot issues.

Skills

Data Pipeline Development
Problem-solving
Data Modeling
ETL Processes
Big Data Technologies
Cloud Platforms
Data Quality and Governance
DevOps
Collaboration

Education

Experience as a Data Engineer
Understanding of data engineering principles

Tools

Python
Databricks
SQL
Apache Spark
Kafka
Docker
Job description
About Sand

Sand Technologies is a fast-growing enterprise AI company that solves real-world problems for large blue-chip companies and governments worldwide.

We’re pioneers of meaningful AI: our solutions go far beyond chatbots. We are using data and AI to solve the world’s biggest issues in telecommunications, sustainable water management, energy, healthcare, climate change, smart cities, and other areas that have a real impact on the world. For example, our AI systems help to manage the water supply for the entire city of London. We created the AI algorithms that enabled the 7th largest telecommunications company in the world to plan its network in 300 cities in record time. And we built a digital healthcare system that enables 30m people in a country to get world-class healthcare despite a shortage of doctors.

We’ve grown our revenues by over 500% in the last 12 months while winning prestigious scientific and industry awards for our cutting-edge technology. We’re underpinned by over 300 engineers and scientists working across Africa, Europe, the UK and the US.

ABOUT THE ROLE

Sand Technologies focuses on cutting-edge cloud-based data projects, leveraging tools such as Databricks, DBT, Docker, Python, SQL, and PySpark to name a few. We work across a variety of data architectures such as Data Mesh, lakehouse, data vault and data warehouses. Our data engineers create pipelines that support our data scientists and power our front-end applications. This means we do data-intensive work for both OLTP and OLAP use cases. Our environments are primarily cloud-native spanning AWS, Azure and GCP, but we also work on systems running self-hosted open source services exclusively. We strive towards a strong code-first, data as a product mindset at all times, where testing and reliability with a keen eye on performance is a non-negotiable.

JOB SUMMARY

A Data Engineer, has the primary role of designing, building, and maintaining scalable data pipelines and infrastructure to support data-intensive applications and analytics solutions. They closely collaborate with data scientists, analysts, and software engineers to ensure efficient data processing, storage, and retrieval for business insights and decision-making. From their expertise in data modelling, ETL (Extract, Transform, Load) processes, and big data technologies it becomes possible to develop robust and reliable data solutions.

RESPONSIBILITIES

  1. Data Pipeline Development: Design, implement, and maintain scalable data pipelines for ingesting, processing, and transforming large volumes of data from various sources using tools such as databricks, python and pyspark.
  2. Data Modeling: Design and optimize data models and schemas for efficient storage, retrieval, and analysis of structured and unstructured data.
  3. ETL Processes: Develop and automate ETL workflows to extract data from diverse sources, transform it into usable formats, and load it into data warehouses, data lakes or lakehouses.
  4. Big Data Technologies: Utilize big data technologies such as Spark, Kafka, and Flink for distributed data processing and analytics.
  5. Cloud Platforms: Deploy and manage data solutions on cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP), leveraging cloud-native services for data storage, processing, and analytics.
  6. Data Quality and Governance: Implement data quality checks, validation processes, and data governance policies to ensure accuracy, consistency, and compliance with regulations.
  7. Monitoring, Optimization and Troubleshooting: Monitor data pipelines and infrastructure performance, identify bottlenecks and optimize for scalability, reliability, and cost-efficiency. Troubleshoot and fix data-related issues.
  8. DevOps: Build and maintain basic CI/CD pipelines, commit code to version control and deploy data solutions.
  9. Collaboration: Collaborate with cross-functional teams, including data scientists, analysts, and software engineers, to understand requirements, define data architectures, and deliver data-driven solutions.
  10. Documentation: Create and maintain technical documentation, including data architecture diagrams, ETL workflows, and system documentation, to facilitate understanding and maintainability of data solutions.
  11. Best Practices: Continuously learn and apply best practices in data engineering and cloud computing.

QUALIFICATIONS

  • Proven experience as a Data Engineer, or in a similar role, with hands-on experience building and optimizing data pipelines and infrastructure.
  • Proven experience working with Big Data and tools used to process Big Data
  • Strong problem-solving and analytical skills with the ability to diagnose and resolve complex data-related issues.
  • Solid understanding of data engineering principles and practices.
  • Excellent communication and collaboration skills to work effectively in cross-functional teams and communicate technical concepts to non-technical stakeholders.
  • Ability to adapt to new technologies, tools, and methodologies in a dynamic and fast-paced environment.
  • Ability to write clean, scalable, robust code using python or similar programming languages. Background in software engineering a plus.

DESIRABLE LANGUAGES/TOOLS

  • Proficiency in programming languages such as Python, Java, Scala, or SQL for data manipulation and scripting.
  • Strong understanding of data modelling concepts and techniques, including relational and dimensional modelling.
  • Experience in big data technologies and frameworks such as Databricks, Spark, Kafka, and Flink.
  • Experience in using modern data architectures, such as lakehouse.
  • Experience with CI/CD pipelines and version control systems like Git.
  • Knowledge of ETL tools and technologies such as Apache Airflow, Informatica, or Talend.
  • Knowledge of data governance and best practices in data management.
  • Familiarity with cloud platforms and services such as AWS, Azure, or GCP for deploying and managing data solutions.
  • Strong problem-solving and analytical skills with the ability to diagnose and resolve complex data-related issues.
  • SQL (for database management and querying)
  • Apache Spark (for distributed data processing)
  • Apache Spark Streaming, Kafka or similar (for real-time data streaming)
  • Experience using data tools in at least one cloud service - AWS, Azure or GCP (e.g. S3, EMR, Redshift, Glue, Azure Data Factory, Databricks, BigQuery, Dataflow, Dataproc

Would you like to join us as we work hard, have fun and make history?

  • 1
  • ...
  • 12
  • 13
  • 14

* The salary benchmark is based on the target salaries of market leaders in their relevant sectors. It is intended to serve as a guide to help Premium Members assess open positions and to help in salary negotiations. The salary benchmark is not provided directly by the company, which could be significantly higher or lower.

Job Search and Career Advice Platform

Empoweringjob seekers

Tools
  • Jobs
  • Resume review
  • Headhunters
  • Browse jobs
Company
  • About us
  • Careers at JobLeads
  • Site notice
  • Reviews
Support
  • Help
  • Partner integration
  • ATS Partners
Social
  • YouTube
  • LinkedIn
  • Instagram
  • Facebook
  • Privacy Policy
  • Terms of Use

© JobLeads 2007 - 2025 | All rights reserved