Enable job alerts via email!

Quantexa Data Engineer

Unison Consulting Pte Ltd

Singapore

On-site

SGD 70,000 - 90,000

Full time

2 days ago

Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading consulting firm in Singapore is seeking a talented Data Engineer to enhance their big data solutions. This role requires expertise in Hadoop, Spark, Scala, and Elasticsearch, along with Quantexa certification. The Data Engineer will collaborate with teams to create scalable data processing pipelines, optimize performance, and ensure data integrity. Ideal candidates will have a strong background in programming and containerization technologies, as well as excellent problem-solving skills. This is an excellent opportunity to make a significant impact in a dynamic environment.

Qualifications

Must be Quantexa certified data engineer / data architect and proficient with the tool.
Proven experience as a Data Engineer, working with Hadoop, Spark, and data processing technologies.
Strong problem-solving and analytical skills with the ability to debug complex issues.

Responsibilities

Implement data transformation, aggregation, and enrichment processes.
Design, develop, and implement Spark Scala applications and data processing pipelines.
Monitor and analyze job performance metrics, identify bottlenecks, and propose optimizations.

Skills

Hadoop

Scala

Spark

Elasticsearch

DevOps

Data Governance

Performance Tuning

Education

Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field

Tools

OpenShift Container Platform

Kubernetes

Spark SQL

Docker

Jenkins

Quantexa Data Engineer

We are seeking a talented and experienced Data Engineer with expertise in Hadoop, Scala, Spark, Elastic, Open Shift Container Platform (OCP) and DevOps practices. Elasticsearch to join our team. As a Data Engineer, you will play a crucial role in designing, developing, and optimizing big data solutions using Apache Spark, Scala, and Elasticsearch. You will collaborate with cross-functional teams to build scalable and efficient data processing pipelines and search applications. Knowledge and experience in the Compliance / AML domain will be a plus. Working experience with Quantexa tool is a must.

Responsibilities

Implement data transformation, aggregation, and enrichment processes to support various data analytics and machine learning initiatives
Collaborate with cross-functional teams to understand data requirements and translate them into effective data engineering solutions
Design, develop, and implement Spark Scala applications and data processing pipelines to process large volumes of structured and unstructured data
Integrate Elasticsearch with Spark to enable efficient indexing, querying, and retrieval of data
Optimize and tune Spark jobs for performance and scalability, ensuring efficient data processing and indexing in Elasticsearch
Implement data transformations, aggregations, and computations using Spark RDDs, DataFrames, and Datasets, and integrate them with Elasticsearch
Develop and maintain scalable and fault-tolerant Spark applications, adhering to industry best practices and coding standards
Troubleshoot and resolve issues related to data processing, performance, and data quality in the Spark-Elasticsearch integration
Monitor and analyze job performance metrics, identify bottlenecks, and propose optimizations in both Spark and Elasticsearch components
Ensure data quality and integrity throughout the data processing lifecycle
Design and deploy data engineering solutions on OpenShift Container Platform (OCP) using containerization and orchestration techniques
Optimize data engineering workflows for containerized deployment and efficient resource utilization
Collaborate with DevOps teams to streamline deployment processes, implement CI/CD pipelines, and ensure platform stability
Implement data governance practices, data lineage, and metadata management to ensure data accuracy, traceability, and compliance
Monitor and optimize data pipeline performance, troubleshoot issues, and implement necessary enhancements
Implement monitoring and logging mechanisms to ensure the health, availability, and performance of the data infrastructure
Document data engineering processes, workflows, and infrastructure configurations for knowledge sharing and reference

Requirements

Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field
Must be Quantexa certified data engineer / data architect and proficient with the tool.
Proven experience as a Data Engineer, working with Hadoop, Spark, and data processing technologies in large-scale environments
Proficiency in Scala programming language and familiarity with functional programming concepts
Experience with Quantexa tool is highly preferred.
In-depth understanding of Apache Spark architecture, RDDs, DataFrames, and Spark SQL
Strong expertise in designing and developing data infrastructure using Hadoop, Spark, and related tools (HDFS, Hive, Pig, etc)
Experience with containerization platforms such as OpenShift Container Platform (OCP) and container orchestration using Kubernetes
Proficiency in programming languages commonly used in data engineering, such as Spark, Python, Scala, or Java
Knowledge of DevOps practices, CI/CD pipelines, and infrastructure automation tools (e.g., Docker, Jenkins, Ansible, BitBucket)
Experience with Graphana, Prometheus, Splunk will be an added benefit
Experience integrating and working with Elasticsearch for data indexing and search applications
Solid understanding of Elasticsearch data modeling, indexing strategies, and query optimization
Experience with distributed computing, parallel processing, and working with large datasets
Proficient in performance tuning and optimization techniques for Spark applications and Elasticsearch queries
Strong problem-solving and analytical skills with the ability to debug and resolve complex issues
Familiarity with version control systems (e.g., Git) and collaborative development workflows
Excellent communication and teamwork skills with the ability to work effectively in cross-functional teams
Experience with cloud platforms (e.g., AWS, Azure, GCP) and their data services is a plus

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

Top companies

Top positions