Enable job alerts via email!

Vision-Language AI Engineer for Multimodal Systems

Duncan & Ross

Abu Dhabi

On-site

AED 120,000 - 200,000

Full time

Yesterday
Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A technology firm in Abu Dhabi seeks an experienced Computer Vision Engineer to develop AI solutions that bridge image understanding and natural language processing. Candidates should have a strong proficiency in Python and frameworks like PyTorch and TensorFlow, with experience in multimodal AI and deep learning. This role offers a competitive salary and opportunities for innovation in AI applications.

Qualifications

  • 3-7 years of experience in computer vision, deep learning or multimodal AI.
  • Strong proficiency in Python and AI frameworks.
  • Experience integrating LLMs with vision systems.

Responsibilities

  • Develop and implement computer vision models.
  • Integrate vision models with LLMs.
  • Design AI pipelines for multimodal learning.

Skills

Computer Vision
Deep Learning
Python
Transformers
Multimodal AI

Education

Bachelor's or Master's degree in Computer Science

Tools

TensorFlow
PyTorch
OpenCV
Job description
A technology firm in Abu Dhabi seeks an experienced Computer Vision Engineer to develop AI solutions that bridge image understanding and natural language processing. Candidates should have a strong proficiency in Python and frameworks like PyTorch and TensorFlow, with experience in multimodal AI and deep learning. This role offers a competitive salary and opportunities for innovation in AI applications.
Get your free, confidential resume review.
or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.