Ativa os alertas de emprego por e-mail!

Content Editor/Author/Writer

Innodata Inc.

Aracaju

Teletrabalho

BRL 80.000 - 120.000

Tempo integral

Há 13 dias

Resumo da oferta

A leading technology firm in Brazil is seeking experienced professionals in AI red teaming and quality assurance. The role involves conducting rigorous tests on AI-generated content, evaluating vulnerabilities, and collaborating with data scientists. Ideal candidates must possess strong analytical skills and familiarity with LLM safety testing. A C1 or C2 level of English is required for qualification.

Qualificações

Proven experience in AI red teaming, LLM safety testing, or adversarial prompt design.
Familiarity with prompt engineering, NLP tasks, and ethical considerations in generative AI.
Strong background in Quality Assurance, content review, or test case development for AI/ML systems.
Understanding of LLM behaviors, failure modes, and model evaluation metrics.
Excellent critical thinking, pattern recognition, and analytical writing skills.

Responsabilidades

Conduct Red Teaming exercises to identify unsafe outputs from LLMs.
Evaluate AI prompts to uncover potential failure modes.
Develop test cases to assess accuracy and risks in AI-generated responses.
Collaborate with teams to report risks and suggest mitigations.
Perform manual QA and content validation across model versions.

Conhecimentos

AI red teaming

Prompt evaluation

Quality assurance

Analytical writing

Formação académica

Background in linguistics, psychology, or computational ethics

We are seeking highly analytical and detail-oriented professionals with hands-on experience in Red Teaming, Prompt Evaluation, and AI/LLM Quality Assurance.

The ideal candidate will help us rigorously test and evaluate AI-generated content to identify vulnerabilities, assess risks, and ensure compliance with safety, ethical, and quality standards.

Key Responsibilities

Conduct Red Teaming exercises to identify adversarial, harmful, or unsafe outputs from large language models (LLMs).
Evaluate and stress-test AI prompts across multiple domains (e.g., finance, healthcare, security) to uncover potential failure modes.
Develop and apply test cases to assess accuracy, bias, toxicity, hallucinations, and misuse potential in AI-generated responses.
Collaborate with data scientists, safety researchers, and prompt engineers to report risks and suggest mitigations.
Perform manual QA and content validation across model versions, ensuring factual consistency, coherence, and guideline adherence.
Create evaluation frameworks and scoring rubrics for prompt performance and safety compliance.
Document findings, edge cases, and vulnerability reports with high clarity and structure.

Requirements

Proven experience in AI red teaming, LLM safety testing, or adversarial prompt design.
Familiarity with prompt engineering, NLP tasks, and ethical considerations in generative AI.
Strong background in Quality Assurance, content review, or test case development for AI/ML systems.
Understanding of LLM behaviors, failure modes, and model evaluation metrics.
Excellent critical thinking, pattern recognition, and analytical writing skills.
Ability to work independently, follow detailed evaluation protocols, and meet tight deadlines.

Preferred Qualifications

Prior work with teams like OpenAI, Anthropic, Google DeepMind, or other LLM safety initiatives.
Experience in risk assessment, red team security testing, or AI policy & governance.
Background in linguistics, psychology, or computational ethics is a plus.

Next Steps

To proceed further in the evaluation process, you will need to complete two assessments: Assessment Test Evaluates your linguistic and analytical skills. Link: Versant English Proficiency Test focuses on assessing your spoken and written English proficiency. A C1 or C2 level is required to qualify.

Once both assessments are successfully completed, you will be eligible for onboarding.

Language Test

Action Required: XConnect Registration. You will also receive an invitation to our internal job platform, XConnect. Please take a few minutes to register and complete your profile. All project onboarding, communication, and documentation are managed through this platform.

Obtém a tua avaliação gratuita e confidencial do currículo.

ou arrasta um ficheiro em formato PDF, DOC, DOCX, ODT ou PAGES até 5 MB.