Enable job alerts via email!

Data Engineer

CMC-APAC PRIVATE LIMITED

Singapore

On-site

SGD 60,000 - 80,000

Full time

8 days ago

Job summary

A leading data management company in Singapore is seeking a dedicated professional to manage and govern data across various platforms. The ideal candidate will have experience with data lakes, governance processes, and cloud services, as well as programming skills in languages such as Python or Java. Join a dynamic environment to drive data efficiency and reliability.

Qualifications

  • Experience in building and operating large-scale data lakes and warehouses.
  • Advanced working experience with SQL and NoSQL databases.
  • Deep understanding of data governance processes.

Responsibilities

  • Create and manage master records for business entities.
  • Implement data governance processes and quality management.
  • Develop and maintain robust data processing pipelines.

Skills

Data governance
Data processing pipelines
Collaboration with cross-functional teams
Large-scale data lakes and warehouses
Hadoop ecosystem
Data quality management
Cloud services (Azure, GCP, AWS)
SQL and NoSQL databases
Object-oriented programming (Python, Java, Scala)
ETL tools

Tools

Informatica MDM
Talend
Spark
Kafka
Delta Lake
Databricks
Hortonworks Data Platform
Cloudera Data Platform
Talend Big Data
Azure Data Factory
Job description

Job Description & Requirements

Key Responsibilities
  • Create and manage a single master record for each business entity, ensuring data consistency, accuracy, and reliability.
  • Implement data governance processes, including data quality management, data profiling, data remediation, and automated data lineage.
  • Create and maintain multiple robust and high-performance data processing pipelines within Cloud, Private Data Centre, and Hybrid data ecosystems.
  • Assemble large, complex data sets from a wide variety of data sources.
  • Collaborate with Data Scientists, Machine Learning Engineers, Business Analysts, and Business users to derive actionable insights and reliable foresights into customer acquisition, operational efficiency, and other key business performance metrics.
  • Develop, deploy, and maintain multiple microservices, REST APIs, and reporting services.
  • Design and implement internal processes to automate manual workflows, optimize data delivery, and re-design infrastructure for greater scalability.
  • Establish expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Support and work with cross-functional teams in a dynamic environment.
Required Skills & Qualifications
  • Experience building and operating large-scale data lakes and data warehouses.
  • Experience with Hadoop ecosystem and big data tools, including Spark and Kafka.
  • Experience with Master Data Management (MDM) tools and platforms such as Informatica MDM, Talend Data Catalog, Semarchy xDM, IBM PIM & IKC, or Profisee.
  • Experience in data governance, including data quality management, data profiling, data remediation, and automated data lineage.
  • Experience with stream-processing systems including Spark-Streaming.
  • Experience working with Cloud services using one or more Cloud providers such as Azure, GCP, or AWS.
  • Experience with Delta Lake and Databricks.
  • Advanced working experience with relational SQL and NoSQL databases, including Hive, HBase, and Postgres.
  • Deep understanding of SQL and the ability to optimize data queries.
  • Experience with object-oriented/object function scripting languages: Python, Java, Scala, etc.
  • A successful history of manipulating, processing, and extracting value from large, disconnected datasets.
  • Experience applying modern development principles (Scrum, TDD, continuous integration, and code reviews).
  • Proven ability to support and work with cross-functional teams in a dynamic environment.
  • Experience with ETL tools such as Talend Big Data, Azure Data Factory, etc.
  • Experience working with Hortonworks Data Platform or Cloudera Data Platform.
  • Experience with Metadata Management tools.
  • Experience working on projects for Master Data Management.
  • Exposure to Data Governance processes and tools.
  • Proven ability in supporting and working with cross-functional teams in a dynamic environment.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.