Enable job alerts via email!

Vision-Language AI Engineer for Multimodal Systems

Duncan & Ross

Abu Dhabi

On-site

AED 120,000 - 200,000

Full time

Yesterday

Be an early applicant

Generate a tailored resume in minutes

Land an interview and earn more. Learn more

Job summary

A technology firm in Abu Dhabi seeks an experienced Computer Vision Engineer to develop AI solutions that bridge image understanding and natural language processing. Candidates should have a strong proficiency in Python and frameworks like PyTorch and TensorFlow, with experience in multimodal AI and deep learning. This role offers a competitive salary and opportunities for innovation in AI applications.

Qualifications

3-7 years of experience in computer vision, deep learning or multimodal AI.
Strong proficiency in Python and AI frameworks.
Experience integrating LLMs with vision systems.

Responsibilities

Develop and implement computer vision models.
Integrate vision models with LLMs.
Design AI pipelines for multimodal learning.

Skills

Computer Vision

Deep Learning

Python

Transformers

Multimodal AI

Education

Bachelor's or Master's degree in Computer Science

Tools

TensorFlow

PyTorch

OpenCV

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.