¡Activa las notificaciones laborales por email!

Data Scientist - Gen AI Specialist

Indegene

Vitoria

Presencial

EUR 40.000 - 70.000

Jornada completa

Ayer
Sé de los primeros/as/es en solicitar esta vacante

Descripción de la vacante

A global consultancy in life sciences is seeking AI Specialists to develop and train Generative AI models, conduct data analysis, and integrate these models with AWS and Snowflake. The ideal candidates will have strong knowledge of machine learning and experience with scalable AI applications. Additional familiarity with various data processing tools is required.

Formación

  • Strong knowledge in machine learning and Generative AI.
  • Experience building scalable AI applications on AWS.
  • Familiarity with OCR tools is beneficial.

Responsabilidades

  • Develop and train Generative AI models.
  • Perform data analysis and prepare data for AI model training.
  • Integrate AI models with Snowflake and AWS.

Conocimientos

Machine learning
Generative AI
AWS Bedrock
Document parsing
NLP techniques
Python for data science

Herramientas

AWS
PyMuPDF
Apache Tika
Tesseract OCR
spaCy
Hugging Face Transformers
Vector databases
Git
Azure DevOps

Descripción del empleo

Who are we?

Indegene is a global consultancy at the forefront of driving innovation in the Pharmaceutical and Life Sciences industry, combining medical and commercial expertise with innovative digital and AI technologies.

We enable global healthcare organizations to address complex challenges and drive better health and business outcomes by seamlessly integrating analytics, technology, operations, and medical expertise. Find out more at indegene.com.

Who are you?

We are seeking experienced AI Specialists to develop and train Generative AI models, perform data analysis, and prepare data for AI model training. You will also integrate AI models with Snowflake, AWS, and other systems.

Required knowledge:

  • Strong knowledge in machine learning and Generative AI, especially content generation using AWS Bedrock and OpenAI models.
  • Experience building scalable AI applications on AWS.
  • Experience with unstructured data processing, document parsing (PDF, Word, HTML) using tools like PyMuPDF, Apache Tika, or Textract.
  • Familiarity with OCR tools such as Tesseract OCR, AWS Textract, or Azure Form Recognizer.
  • Proficiency in NLP techniques using spaCy, NLTK, or Hugging Face Transformers, including NER.
  • Experience with vector databases such as FAISS, Qdrant, Pinecone, Weaviate, or ChromaDB for semantic search and retrieval.
  • Understanding of embedding models like OpenAI, Cohere, or Sentence-BERT.
  • Experience in chunking and indexing large documents, maintaining metadata and document structure.
  • Strong background in vector databases; experience with Snowflake and graph databases is a plus.
  • Knowledge of Agentic AI is a plus.
  • Proficiency in Python for data science, Streamlit for prototypes, Git, and Azure DevOps.
  • Experience working in agile, international environments, with CI/CD pipelines and test-driven development.
  • Good documentation and coaching skills.

We are an Equal Opportunity Employer committed to diversity and inclusion. All qualified applicants will receive consideration without discrimination based on race, religion, sex, age, or other characteristics.

Consigue la evaluación confidencial y gratuita de tu currículum.
o arrastra un archivo en formato PDF, DOC, DOCX, ODT o PAGES de hasta 5 MB.