
Enable job alerts via email!
Generate a tailored resume in minutes
Land an interview and earn more. Learn more
A leading technology company seeks professionals fluent in Arabic to enhance and evaluate multilingual datasets for large language models. Responsibilities include rubric design, prompt evaluation, and documenting cultural nuances in model performance. Candidates should have a Master's degree and experience in LLM evaluation or linguistic QA. Attention to detail and cultural familiarity are essential for success in this role.
We are looking for linguistically and culturally aware professionals to support the evaluation and enhancement of multilingual prompt-response datasets for large language models (LLMs). This role involves rubric design, evaluation of translations and model outputs, prompt creation, and red teaming focused on identifying and surfacing cultural nuances and biases in LLM behaviour.
Key Responsibilities :
Rubric Definition & Prompt Evaluation