Role Summary
Support the organization’s data governance and data quality foundations by working with SQL, Python, and R to cleanse, standardize and prepare data for BI tools (Qlik Sense and Power BI). This role offers hands‑on experience with ETL processes, data mapping, and cross‑system validation, with mentoring from senior team members.
Key Responsibilities
- Data engineering (entry‑level)
- Assist in structuring and normalizing source data for ingestion and analytical use.
- Support the design and maintenance of basic ETL/ELT processes and data pipelines.
- Implement Star Schema elements under guidance (dimensions and fact tables).
- Data mapping and cataloging
- Scan and document databases to map and harmonize information (contribute to data dictionaries and lineage).
- Help identify inconsistencies, duplicates, and basic data quality issues.
- Standardization and cleansing
- Apply established methodologies to standardize and cleanse datasets.
- Perform data cleanup, deduplication, and reconciliation tasks.
- Development and automation
- Develop and maintain scripts to cross‑reference and consolidate data between systems (SQL, Python, R).
- Contribute to simple applications or interfaces that enable controlled data access.
- Validation and quality control
- Validate datasets against other systems or sources and report discrepancies.
- Perform exploratory analysis and prepare concise quality reports for key databases.
- Analytical front‑end preparation
- Create basic master measures and prepare datasets for Qlik Sense and Power BI.
- Work with BI to ensure datasets are ready for visualization and end‑user consumption.
Typical Systems and Data Sources
- Veeva (Approved Email, Visits, Events, etc.)
- Salesforce Marketing Cloud (SFMC)
- Flash for WhatsApp
- Relational databases and internal data warehouses
- Qlik Sense and Power BI for visualization and reporting
Required Skills and Experience
- Basic to intermediate experience in SQL (writing queries, joins, aggregations).
- Programming experience in Python and R for data manipulation and scripting.
- Practical experience developing dashboards and datasets in Qlik Sense and Power BI.
- Familiarity with data cleaning, deduplication, and standardization techniques.
- Understanding of ETL/ELT concepts and dimensional modeling (Star Schema) is a plus.
- Good analytical skills, attention to detail, and ability to work in a team with mentoring.
- Effective communication skills in English (or local language as required).
Preferred (not required)
- Experience with data cataloging or lineage tools.
- Familiarity with marketing/CRM platforms such as Veeva and SFMC.
- Basic knowledge of APIs and data access mechanisms.