Job Search and Career Advice Platform

Enable job alerts via email!

Data Engineer

United Arab Emirates University

Abu Dhabi Emirate

On-site

AED 120,000 - 200,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A prestigious research university in Abu Dhabi is seeking motivated Data Engineers to develop and deploy scalable AI and NLP systems. This role focuses on LLM fine-tuning and collaborating with multidisciplinary teams. Applicants should have a Bachelor's in a related field, strong programming skills in Python, and experience with machine learning frameworks. This position offers an exceptional opportunity to contribute to cutting-edge projects advancing AI capabilities in the UAE.

Qualifications

  • Bachelor’s degree in Computer Science, AI, Data Science, Data Engineering, or a related field.
  • Strong programming skills in Python and experience with data processing libraries.
  • Experience in machine learning model development using frameworks such as TensorFlow or PyTorch.

Responsibilities

  • Design and maintain data pipelines supporting AI/ML projects.
  • Develop and deploy LLMs and NLP pipelines for real-world use cases.
  • Conduct data preprocessing and feature extraction for structured and unstructured data.

Skills

Python programming
Data processing libraries (Pandas, NumPy, PySpark)
Machine learning frameworks (TensorFlow, PyTorch)
NLP and LLM ecosystems
Analytical and problem-solving skills
Strong communication skills

Education

Bachelor’s degree in Computer Science, AI, Data Science or related field
Master’s degree in AI or related field

Tools

AWS
Azure
GCP
Docker
Job description
Job Description:

The Big Data Analytics Center (BIDAC) at UAE University is expanding its research and innovation team to advance the next generation of AI, Machine Learning, and Large Language Model (LLM) applications. The Center plays a national leadership role in AI research, data infrastructure, and digital transformation, supporting major UAE initiatives in smart government, education, and innovation. We are seeking highly motivated Data Engineers who will contribute to developing and deploying scalable AI and NLP systems, with a focus on LLM fine-tuning, optimization, and deployment in secure, domain-specific environments. This is an exceptional opportunity to work in a cutting-edge AI lab within a top research university, collaborating with multidisciplinary teams to deliver impactful projects that advance the UAE’s AI capabilities.

Key Responsibilities:
  • Design, build, and maintain data pipelines, ETL workflows, and databases supporting AI/ML projects.
  • Develop and deploy LLMs and NLP pipelines for real-world use cases (education, government services, healthcare, etc.).
  • Conduct data preprocessing, cleaning, feature extraction, and model training for structured and unstructured data.
  • Fine-tune and optimize foundation models (GPT, LLaMA, Falcon, etc.) using domain-specific datasets.
  • Collaborate with research teams on AI model evaluation, interpretability, and explainability.
  • Integrate AI models with front-end applications, APIs, and databases for operational deployment.
  • Support research publications, grant proposals, and technical documentation under BIDAC initiatives.
Minimum Qualifications:
  • Bachelor’s degree (BSc) in Computer Science, AI, Data Science, Data Engineering, or a related field.
  • Strong programming skills in Python and experience with data processing libraries (Pandas, NumPy, PySpark, etc.).
  • Experience in machine learning model development using frameworks such as TensorFlow, PyTorch, or Scikit-Learn.
  • Solid understanding of data structures, algorithms, and statistical methods.
  • Experience working with databases (SQL/NoSQL) and cloud platforms (AWS, Azure, or GCP).
Experience/Skills:
  • Proficiency in NLP and LLM ecosystems, including tokenization, embeddings, transformers, and model fine-tuning.
  • Familiarity with LangChain, Hugging Face Transformers, and OpenAI / Anthropic / Cohere APIs.
  • Understanding of MLOps, containerization (Docker), and deployment tools (FastAPI, Streamlit, MLflow, etc.).
  • Strong knowledge of data versioning and reproducibility tools (DVC, Git, etc.).
  • Ability to handle large-scale, multi-modal datasets (text, audio, video, sensor data).
  • Excellent analytical and problem-solving skills with a research mindset.
  • Strong written and verbal communication skills for technical and interdisciplinary collaboration.
Preferred Qualifications:
  • Master’s degree in AI, Data Science or a related field.
  • Prior experience in LLM fine-tuning, distillation, or on-premises deployment (e.g., Falcon, LLaMA, Mistral).
  • Experience building knowledge graphs or retrieval-augmented generation (RAG) pipelines.
  • Knowledge of distributed computing (Spark, Dask, Ray) and data lake architectures.
  • Experience integrating AI systems into production-grade web or enterprise applications.
  • Contribution to open-source AI projects, or publications in high-impact AI/ML venues.
  • Understanding of Arabic NLP and bilingual (EN/AR) model development is a plus.

Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.