Enable job alerts via email!

LLM Evaluation Specialist – Cultural and Linguistic Alignment - Arabic native speaker -

Innodata

Riyadh Region

On-site

SAR 120,000 - 150,000

Full time

Today

Be an early applicant

Job summary

A leading tech company is seeking linguistically and culturally aware professionals in Riyadh to evaluate and enhance multilingual prompt-response datasets for large language models. Responsibilities include rubric design, translation evaluation, and prompt creation. Ideal candidates will have native Arabic proficiency and experience in LLM evaluation. This role is critical for ensuring cultural relevance in AI outputs.

Qualifications

Deep familiarity with the cultural norms in the corresponding region.
Strong attention to detail with the ability to identify subtle issues.
Experience in content moderation or linguistic QA preferred.

Responsibilities

Update rubric definitions with Arabic specific examples.
Review prompts translated from English into Arabic.
Rate prompt-response pairs using a standardized evaluation template.

Skills

Native proficiency in Arabic

Attention to detail

Experience in LLM evaluation

Familiarity with cultural norms

Education

Master's degree in relevant stream

Tools

Spreadsheets

Evaluation templates

Overview

We are looking for linguistically and culturally aware professionals to support the evaluation and enhancement of multilingual prompt-response datasets for large language models (LLMs). This role involves rubric design, evaluation of translations and model outputs, prompt creation, and red teaming focused on identifying and surfacing cultural nuances and biases in LLM behaviour.

Responsibilities

Rubric Definition & Prompt Evaluation
Update rubric definitions with Arabic specific examples to ensure cultural and linguistic relevance.
Identify the need for additional rubrics tailored to specific languages or regional contexts.
Review prompts translated from English into Arabic and revise where translations appear unnatural or inaccurate.
Writing of thoughtful prompts which can test the cultural awareness of LLM models.
Rate prompt-response pairs using a standardized evaluation template based on rubrics and provide detailed justifications to base the findings.
Document problematic outputs and annotate them with clear explanations of rubric violations or cultural insensitivities.

Required Qualifications

Native proficiency in the Arabic and deep familiarity with cultural norms in the corresponding region.
Experience in LLM evaluation, content moderation, or linguistic QA preferred.
Strong attention to detail with the ability to identify subtle issues in language use, tone, and cultural references.
Comfortable working in spreadsheets and evaluation templates.
Master’s degree in relevant stream.

Preferred Qualifications

Prior experience with prompt engineering or LLM testing.
Familiarity with tools such as Gemini, ChatGPT or similar LLM platforms.
Ability to clearly articulate reasoning behind rubric ratings or prompt edits.

Get your free, confidential resume review.

or drag and drop a PDF, DOC, DOCX, ODT, or PAGES file up to 5MB.

LLM Evaluation Specialist – Cultural and Linguistic Alignment - Arabic native speaker -

Innodata

Riyadh Region

On-site

SAR 120,000 - 150,000

Full time

Job summary

Qualifications

Responsibilities

Skills

Education

Tools

Company

Services

Free resources

Support

LLM Evaluation Specialist – Cultural and Linguistic Alignment - Arabic native speaker -

Innodata

Riyadh Region

On-site

SAR 120,000 - 150,000

Full time

Job summary

Qualifications

Responsibilities

Skills

Education

Tools

Follow us

Company

Services

Free resources

Support