¡Activa las notificaciones laborales por email!

Data Engineer

Sealed

México

A distancia

USD 60,000 - 90,000

Jornada completa

Ayer
Sé de los primeros/as/es en solicitar esta vacante

Descripción de la vacante

A data engineering firm is seeking two Azure Data Engineers for a master data project in Mexico. One must be a senior lead engineer while the other can be mid to senior level. The role involves building data pipelines, normalizing JSON, and creating combined datasets. Familiarity with Python and various data matching algorithms is essential, along with Azure platform skills. Candidates should be self-starters and able to work independently.

Formación

  • Advanced Python programming skills required.
  • Experience with various Data Matching Algorithms.
  • Basic knowledge of Graph and Vector databases.
  • Familiarity with Azure platform like Databricks or Fabric.
  • Strong independent problem-solving abilities.

Responsabilidades

  • Work on data acquisition into Azure and build data pipelines.
  • Normalize JSON data into relational format.
  • Model and design table structures.
  • Analyze data to identify relationships.
  • Create combined datasets using various matching techniques.
  • Build APIs for data search.
  • Develop lightweight UI for user functionality.

Conocimientos

Python programming
Data Matching Algorithms
Basic Graph DB / Vector DB knowledge
ML-based solution approach
Azure platform
API programming

Descripción del empleo

Job Description

Must Have

  • Python programming (Advance)
  • Various Data Matching Algorithms (String Similarity based, distance-based, phonetics algorithm)
  • Basic Graph DB / Vector DB knowledge
  • ML-based solution approach to define/ refine data matching criteria
  • Azure platform(Databricks / Fabric / OpenAI etc.)

Good To Have

  • API programming (FastAPI or Flask)
  • We have a need for two Azure data engineers for a master data project. One person should be a senior lead engineer almost at architect level and second could be mid to senior engineer.
  • The project requires data acquisition into Azure, creating data pipelines, normalize JSON data into relational format.
  • Model and design table structures
  • Analyze data and identify relationships,
  • Match the data with another dataset to find patterns and relationship
  • Create combined datasets by using various matching and merging techniques and logics
  • Build APIs to search the data
  • Build lightweight UI to enable search functionality for users
  • Main idea is we need self starter who can work independently and figure out the solution on their own, work with client team, discuss and modify accordingly.

Source
Remotive: easily access active and fully remote job opportunities in Software Development from vetted tech companies.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.