Ativa os alertas de emprego por e-mail!

Multimodal Ai Evaluator

Bebeeevaluations

Teletrabalho

BRL 80.000 - 120.000

Tempo integral

Há 8 dias

Cria um currículo personalizado em poucos minutos

Consegue uma entrevista e ganha mais. Sabe mais

Resumo da oferta

A leading evaluation firm in Brazil is seeking detail-oriented professionals to evaluate AI system outputs in text, image, and video. Responsibilities include assessing quality, providing detailed feedback, and identifying errors in AI responses. Candidates must have strong critical reading skills, excellent English comprehension, and familiarity with large language models. Attention to detail and cultural awareness are also essential. This role plays a crucial part in the development of advanced AI systems.

Qualificações

Strong critical reading and observational skills required.
Ability to provide nuanced judgments clearly.
Excellent English comprehension is essential.

Responsabilidades

Evaluate outputs generated by AI across multiple modalities.
Assess quality against specific project criteria.
Provide detailed feedback and scoring for evaluations.

Conhecimentos

Critical reading skills

Evaluative skills

Attention to detail

English comprehension (CEFR B2 or above)

Familiarity with LLMs and generative AI

Cultural awareness

Job OverviewiMerit seeks detail-oriented and analytically minded professionals to perform highly nuanced evaluations of AI system outputs across different modalities : text, image, video, and multimodal interactions.

Evaluators will assess the accuracy, appropriateness, quality, clarity, and cultural alignment of model outputs against complex guidelines, ensuring that results align with project standards and real-world use cases.

These evaluations will directly inform the development and fine-tuning of advanced large language models (LLMs), vision models (LVMs), and multimodal AI systems.

Key Responsibilities:

Evaluate outputs generated by LLMs across multiple modalities (text, image captions, video descriptions, and multimodal prompts).
Assess quality against project-specific criteria such as correctness, coherence, completeness, style, cultural appropriateness, and safety.
Identify subtle errors, hallucinations, or biases in AI responses.
Apply domain expertise and logical reasoning to resolve ambiguous or unclear outputs.
Provide detailed written feedback, tagging, and scoring of outputs to ensure consistency across the evaluation team.
Escalate unclear cases and contribute to refining evaluation guidelines.
Collaborate with Project Managers and Quality Leads to meet accuracy, reliability, and turnaround benchmarks.

Required Skills & Qualifications:

Strong critical reading, observational, and evaluative skills across different modalities.
Ability to articulate nuanced judgments with precision and clarity.
Excellent English comprehension (CEFR B2 or above); additional languages a plus.
Familiarity with LLMs, generative AI, and multimodal systems.
Strong attention to detail and ability to apply guidelines consistently.
Awareness of cultural and linguistic nuances, including potential bias and harm in AI outputs.
Comfort with evolving workflows, rapid feedback cycles, and complex quality frameworks.

Obtém a tua avaliação gratuita e confidencial do currículo.

ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.

Melhores cidades

Melhores empresas

Ofertas populares