Job Search and Career Advice Platform

Enable job alerts via email!

NLP Data Engineer: Large-Scale AI Data Pipelines

Institute of Foundation Models

Abu Dhabi

On-site

AED 120,000 - 200,000

Full time

9 days ago

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A leading AI research institute in Abu Dhabi is seeking a Data Engineer specializing in Natural Language Processing. You will gather and prepare datasets to support NLP research, develop web crawling solutions, and implement scalable data pipelines. The ideal candidate has extensive experience in Python and data engineering. This role offers the chance to work alongside world-class researchers on impactful AI projects.

Qualifications

  • Bachelor's degree in a related technical field is required.
  • Master’s degree is preferred.

Responsibilities

  • Rapidly collect and prepare high-quality datasets for NLP research.
  • Develop and maintain web crawling solutions and APIs.
  • Refine outputs from LLMs to generate structured datasets.
  • Implement scalable data pipelines and document methodologies.
  • Collaborate with researchers to ensure data meets quality standards.

Skills

Data engineering
Python
Web crawling
Data processing
SQL
Cloud infrastructure
Data structures
Collaboration

Education

Bachelor's degree in Computer Science, Data Science, Engineering
Master's degree or equivalent experience

Tools

AWS
Spark
Kafka
Kubernetes
Job description
A leading AI research institute in Abu Dhabi is seeking a Data Engineer specializing in Natural Language Processing. You will gather and prepare datasets to support NLP research, develop web crawling solutions, and implement scalable data pipelines. The ideal candidate has extensive experience in Python and data engineering. This role offers the chance to work alongside world-class researchers on impactful AI projects.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.