¡Activa las notificaciones laborales por email!

Senior (Big) Data Engineer

Ll Oefentherapie

México

Presencial

MXN 600,000 - 900,000

Jornada completa

Hace 12 días

Genera un currículum adaptado en cuestión de minutos

Consigue la entrevista y gana más. Más información

Descripción de la vacante

A data solutions company in Mexico is seeking a skilled individual to architect and optimize big data solutions. The ideal candidate will have proficiency in Python or Java, experience with Apache Spark, and hands-on expertise in data modeling and ETL processes. This role involves developing scalable solutions and data integration from various sources into centralized repositories.

Formación

Proficiency in Python, Java, or Scala for big data processing.
Strong expertise in big data frameworks like Apache Spark, Hadoop, Hive.
Hands-on experience with data modeling and ETL/ELT development.

Responsabilidades

Architect, design, and optimize scalable big data solutions.
Develop and maintain ETL/ELT pipelines.
Integrate data from diverse sources into centralized repositories.

Conocimientos

Python

Java

Scala

Apache Spark

Hadoop

Hive

Apache NiFi

REST APIs

SQL

Herramientas

Informatica

Talend

Docker

Kubernetes

Key responsibilities

Architect, design, and optimize scalable big data solutions for batch and real-time processing.
Develop and maintain ETL/ELT pipelines to ingest, transform, and synchronize data from diverse sources.
Integrate data from cloud applications, on-prem systems, APIs, and streaming workspaces into centralized data repositories.
Implement and managedata lakes anddata warehouses solutions on cloud infrastructure.
Ensuredata consistency, quality, and compliance with governance and security standards.
Collaborate with data architects, data engineers, and business stakeholders to align integration solutions with organizational needs.

Core qualifications

Proficiency in Python, Java, or Scala for big data processing.
Big Data Frameworks: Strong expertise in Apache Spark, Hadoop, Hive, Flink, or Kafka.
Hands‑on experience with data modeling, data lakes (Delta Lake, Iceberg, Hudi), and data warehouses (Snowflake, Redshift, BigQuery).
ETL/ELT Development: Expertise with tools like Informatica, Talend, SSIS, Apache NiFi, dbt, or custom Python‑based frameworks.
APIs & Integration: Strong hands‑on experience with REST, SOAP, GraphQL APIs, and integration platforms (MuleSoft, Dell Boomi, SnapLogic).
Data Pipelines: Proficiency in batch and real‑time integration (Kafka, AWS Kinesis/ Azure Event Hub/ GCP Pub/Sub).
Databases: Deep knowledge of SQL (Oracle, PostgreSQL, SQL Server) and NoSQL (MongoDB, Cassandra, DynamoDB) systems.

Preferred experience

Expertise with at least one major cloud platform (AWS, Azure, GCP).
Experience with data services such as AWS EMR/Glue, GCP Dataflow/Dataproc, or Azure Data Factory.
Familiarity with containerization (Docker) and orchestration (Kubernetes).
Knowledge of CI/CD pipelines for data engineering.
Experience with OCI and Oracle Database (including JSON/REST, sharding) and/or Oracle microservices tooling.

How we’ll assess

Systems design interview: architect a scalable service; justify data models, caching, and failure handling.
Coding exercise: implement and optimize a core algorithm/data‑structure problem; discuss trade‑offs.
Code review: evaluate readability, testing, error handling, and security considerations.
Practical discussion: walk through a past end‑to‑end project, metrics/SLOs, incidents, and learnings.

Consigue la evaluación confidencial y gratuita de tu currículum.

o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.