Job Search and Career Advice Platform

Ativa os alertas de emprego por e-mail!

Multimodal Genai Evaluation Specialist

Bebeeevaluations

Manaus

Presencial

BRL 80.000 - 120.000

Tempo integral

Há 7 dias
Torna-te num dos primeiros candidatos

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A leading technology firm in Brazil is seeking skilled Multimodal GenAI Evaluation Specialists to evaluate the quality and accuracy of model outputs across various formats. The role involves assessing texts, image captions, and videos against defined criteria, identifying biases or errors, and providing detailed feedback. Candidates should demonstrate strong analytical skills and the ability to collaborate effectively with project managers. This position promises a dynamic work environment focused on AI evaluation.

Responsabilidades

  • Evaluate outputs generated by LLMs across multiple modalities.
  • Assess quality against project-specific criteria such as correctness and coherence.
  • Identify subtle errors, hallucinations, or biases in AI responses.
  • Provide detailed written feedback, tagging, and scoring of outputs.
  • Collaborate with Project Managers and Quality Leads.
Descrição da oferta de emprego
Job Overview

iMerit seeks highly skilled Multimodal GenAI Evaluation Specialists to assess the accuracy, appropriateness, quality, clarity, and cultural alignment of model outputs against complex guidelines.

Key Responsibilities
  • Evaluate outputs generated by LLMs across multiple modalities (text, image captions, video descriptions, and multimodal prompts).
  • Assess quality against project-specific criteria such as correctness, coherence, completeness, style, cultural appropriateness, and safety.
  • Identify subtle errors, hallucinations, or biases in AI responses.
  • Apply domain expertise and logical reasoning to resolve ambiguous or unclear outputs.
  • Provide detailed written feedback, tagging, and scoring of outputs to ensure consistency across the evaluation team.
  • Collaborate with Project Managers and Quality Leads to meet accuracy, reliability, and turnaround benchmarks.
Obtém a tua avaliação gratuita e confidencial do currículo.
ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.