¡Activa las notificaciones laborales por email!

Data Engineer Banking

BlueSnap, Inc

Madrid

Presencial

EUR 40.000 - 60.000

Jornada completa

Ayer
Sé de los primeros/as/es en solicitar esta vacante

Mejora tus posibilidades de llegar a la entrevista

Elabora un currículum adaptado a la vacante para tener más posibilidades de triunfar.

Descripción de la vacante

ThetaRay is seeking a Data Engineer to join our team in Madrid. As part of a forward-thinking tech environment, you'll design and optimize data pipelines, utilizing cutting-edge technologies to combat money laundering. We're looking for individuals with strong analytical skills, hands-on experience with Apache Spark, and a commitment to innovative, safe financial solutions.

Formación

  • 2+ years of experience with Apache Spark and SQL.
  • Experience with data transformation and ML feature engineering.
  • Fluent in English and Spanish.

Responsabilidades

  • Implement and maintain data pipeline flows for the ThetaRay system.
  • Collaborate with teams to enhance system functionality.
  • Train customer data scientists to maintain data pipelines.

Conocimientos

Data transformation
Data cleansing
SQL
Apache Spark
Machine Learning
Analytical skills
Python
Version control (GIT)
Collaboration

Educación

BSc degree in Computer Science
Statistics

Herramientas

Apache Hadoop
Hive
Elasticsearch
Docker
Jenkins

Descripción del empleo

ThetaRay is a trailblazer in AI-powered Anti-Money Laundering (AML) solutions, offering cutting-edge technology to fintechs, banks, and regulatory bodies worldwide. Our mission is to enhance trust in financial transactions, ensuring compliant and innovative business growth.

Our technology empowers customers to expand into new markets and introduce groundbreaking products.

Why Join ThetaRay?

At ThetaRay, you'll be part of a dynamic global team committed to redefining the financial services sector through technological innovation. You will contribute to creating safer financial environments and have the opportunity to work with some of the brightest minds in AI, ML, and financial technology. We offer a collaborative, inclusive, and forward-thinking work environment where your ideas and contributions are valued and encouraged.

Join us in our mission to revolutionize the financial world, making it safer and more trustworthy for millions worldwide. Explore exciting career opportunities at ThetaRay – where innovation meets purpose.

We are looking for a Data Engineer to join our growing team of data experts. As a Data Engineer , you will be responsible for designing, implementing, and optimizing data pipeline flows within the ThetaRay system. You will support our data scientists with the implementation of relevant data flows based on their feature designs and construct complex rules to detect money laundering activity.

The ideal candidate has experience in building data pipelines and data transformations and enjoys optimizing data flows and building them from the ground up. They must be self-directed and comfortable supporting multiple production implementations for various use cases.

Responsibilities

  • Implement and maintain data pipeline flows in production within the ThetaRay system based on the data scientist’s design
  • Design and implement solution-based data flows for specific use cases, enabling their applicability within the ThetaRay product
  • Build a Machine Learning data pipeline
  • Create data tools for analytics and data scientist team members to assist in building and optimizing our product into an industry leader
  • Collaborate with product, R&D, data, and analytics experts to enhance system functionality
  • Train customer data scientists and engineers to maintain and amend data pipelines within the product
  • Travel to customer locations domestically and abroad
  • Build and manage technical relationships with customers and partners

Requirements

  • 2+ years of hands-on experience working with Apache Spark
  • Hands-on experience with SQL
  • Experience with version-control tools such as GIT
  • Experience with Apache Hadoop Ecosystem including Hive, Impala, Hue, HDFS, Sqoop, etc.
  • Experience with Python (Pandas)
  • Experience with PySpark / Scala / Java / R
  • Experience with data transformation, validation, cleansing, and ML feature engineering
  • BSc degree or higher in Computer Science, Statistics, Informatics, Information Systems, Engineering, or another quantitative field
  • Experience with big data pipelines, architectures, and datasets is advantageous
  • Strong analytical skills with structured and semi-structured data
  • Ability to support data transformation processes, data structures, metadata, dependencies, and workload management
  • Experience with root cause analysis on data and processes to identify improvements
  • Business-oriented with the ability to work with external customers and cross-functional teams
  • Fluent in English and Spanish, both written and spoken

Nice to have

  • Experience with Linux
  • Experience in building Machine Learning pipelines
  • Experience with Elasticsearch
  • Experience with Zeppelin / Jupyter
  • Experience with workflow automation platforms such as Jenkins or Apache Airflow
  • Experience with Microservices architecture components, including Docker and Kubernetes

J-18808-Ljbffr

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.