¡Activa las notificaciones laborales por email!

Senior Data Scientist, Gen AI

Adyen

Valencia

Presencial

EUR 50.000 - 90.000

Jornada completa

Hace 26 días

Mejora tus posibilidades de llegar a la entrevista

Elabora un currículum adaptado a la vacante para tener más posibilidades de triunfar.

Descripción de la vacante

This innovative firm is seeking a skilled Data Scientist to lead the development of Generative AI solutions. In this role, you will harness your expertise in Natural Language Processing and machine learning to create algorithms that drive data products. You will work collaboratively across teams to integrate cutting-edge LLM applications, ensuring efficient data preprocessing and model deployment. If you're passionate about tackling unique technical challenges and driving innovation, this role offers a fantastic opportunity to make a significant impact in a dynamic environment.

Formación

  • 5+ years of experience as a Data Scientist with a focus on NLP.
  • Proven skills in developing and monitoring machine learning algorithms.

Responsabilidades

  • Build and interpret algorithms that power data products using Generative AI.
  • Collaborate with teams to integrate LLM applications across systems.

Conocimientos

Natural Language Processing (NLP)
Machine Learning Algorithms
Python Development
Statistical Modeling
Analytical Thinking
Experimentation and Iterative Development

Educación

Bachelor's Degree in Data Science or related field
Master's Degree in Data Science or related field

Herramientas

Pandas
Numpy
Scikit-learn
PySpark
SQL
Airflow
MLflow
Pytorch
HuggingFace Transformers
Git
Docker
Kubernetes

Descripción del empleo

Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.

For our teams, we create an environment with opportunities for our people to succeed, backed by the culture and support to ensure they are enabled to truly own their careers. We are motivated individuals who tackle unique technical challenges at scale and solve them as a team. Together, we deliver innovative and ethical solutions that help businesses achieve their ambitions faster.

This is Gen AI

Our team's mission is to create a Generative AI platform at Adyen that supports various applications based on LLMs. This involves developing platform-oriented components for deploying an LLM backend within Adyen's GPU cluster in Kubernetes, with features like monitoring, access control, rate limiting, prompt debugging, and experiment tracking. We mainly use Open Source frameworks like HuggingFace and LangChain, and models like Llama or Mixtral. This involves developing platform components, but also delivering on some of the most promising use cases across different areas within the company. Through use cases like support case routing and sentiment analysis, they showcase AI's adaptability across different domains within the organization, revolutionizing workflows and decision-making processes.

What you'll do

  1. You will be responsible for building and interpreting algorithms that power data products at Adyen using Generative AI or NLP techniques. That means leading end-to-end development from prompt engineering, few-shot learning, and fine-tuning self-hosted LLMs if needed.
  2. Work on Natural Language Processing techniques and LLMs to tackle text classification, sentiment analysis, and summarization of Question-Answering retrieval.
  3. Provide technical guidance and mentorship to other data scientists specialized in different domains.
  4. Collaborate with cross-functional teams across Adyen to integrate LLM applications in different systems.
  5. Ensure efficient data preprocessing and ETL pipelines to create features that feed machine learning algorithms during training.
  6. Set up experiments to adjust modeling decisions, perform exploratory analysis, tune hyperparameters, or validate hypothesis selection for the right metric set for each business problem. Report metrics and monitor performance to keep stakeholders updated and ensure a smooth model deployment in production.
  7. Iterate with merchants and product audiences, creating algorithms that power state-of-the-art machine learning-based solutions, and be able to explain the reasons behind the executed inference.

Who you are

  1. You have 5+ years of professional experience as a Data Scientist.
  2. Proven experience developing, training, validating, benchmarking, and monitoring machine learning algorithms, particularly in the natural language processing domain.
  3. Extensive knowledge of machine learning algorithms, including a deep understanding of statistical modeling and Python development tooling and libraries: Pandas, Numpy, Scikit-learn, Pytest, PySpark, SQL, Airflow, and MLflow. Strong experience with machine learning frameworks such as Pytorch and HuggingFace Transformers.
  4. Familiarity with prompt engineering techniques and frameworks like LangChain, LlamaIndex, or DSpy. Good understanding of LLM models, including other components like VectorDBs and document loaders.
  5. Knowledge of version control systems (e.g., Git), C / CD, RESTful APIs, containerized applications (Docker), and microservices deployed in Kubernetes. Adherence to coding best practices, including code reusability, documentation, and testing.
  6. You are an analytical thinker with a knack for understanding operational requirements and converting them into actionable ML solutions.
  7. Proactively taking the lead in projects, from ideation to deployment, while ensuring stakeholder collaboration.
  8. You can communicate complex outcomes with clarity over a wide range of audiences.
  9. We appreciate a forward-thinking mindset driven by experimentation and iterative development. A solid foundation in statistics and mathematics will serve you well in this role.

This role is based out of our Madrid office. We are an office-first company and value in-person collaboration; we do not offer remote-only roles.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.