Enable job alerts via email!

Multimodal Large Model Algorithm Engineer

WECHAT INTERNATIONAL PTE. LTD.

Singapore

On-site

SGD 60,000 - 80,000

Full time

Today

Be an early applicant

Job summary

A leading technology company in Singapore is seeking a talented individual to conduct research and development of multimodal large model technologies. The ideal candidate will have a Master’s degree or higher in fields such as Computer Science or Machine Learning, strong engineering skills, and the ability to work on innovative solutions for complex problems. Familiarity with deep learning frameworks and a solid research background in multimodal understanding are essential for this role.

Qualifications

Master’s degree or higher in Computer Science, Machine Learning, or related fields.
Solid research background in multimodal understanding and familiarity with algorithms.
Proficiency in deep learning frameworks like TensorFlow or PyTorch.

Responsibilities

Conduct research and development of multimodal large model technologies.
Track algorithms in multimodal large models and evaluate these models.

Skills

Deep learning frameworks

Programming languages (C/C++, Java, Python)

Multimodal understanding

Teamwork and communication skills

Technical curiosity

Education

Master’s degree or higher in Computer Science or related fields

Tools

TensorFlow

PyTorch

DeepSpeed

Megatron-LM

Job Description

Conduct research and development of multimodal large model technologies, including cross-modal alignment and multimodal understanding tasks, to build industry-leading multimodal large models.
Continuously track state-of-the art algorithms in multimodal large models, participate in the design, training, optimization, and evaluation of these models, and promote their application in business scenarios.

Job Requirements

Master’s degree or higher in Computer Science, Machine Learning, Artificial Intelligence, Applied Mathematics, or related fields.
Solid research background in multimodal understanding (e.g., natural language processing, computer vision, speech understanding/generation), with familiarity in mainstream models and algorithms such as CLIP, LLaVA, VALL-E, etc.
Proficiency in deep learning frameworks like TensorFlow or PyTorch; knowledge of distributed training frameworks (e.g., DeepSpeed, Megatron-LM) and practical experience in multi-node/multi-GPU distributed training.
Strong engineering skills with proficiency in at least one programming language: C/C++, Java, or Python.
Publication record in top-tier conferences (e.g., ICLR, NeurIPS, CVPR, ICCV, ECCV, ACL, EMNLP) is preferred.
Excellent learning ability, technical curiosity, and strong teamwork and communication skills.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.