Aktiviere Job-Benachrichtigungen per E-Mail!

Red Teaming Prompt Writer – Cultural Awareness

Innodata Inc.

Frankfurt

Vor Ort

EUR 40.000 - 60.000

Vollzeit

Vor 3 Tagen
Sei unter den ersten Bewerbenden

Zusammenfassung

A global data engineering company seeks an AI Red Teaming & Prompt Evaluation Specialist in Frankfurt, Germany. The role involves conducting Red Teaming exercises, evaluating AI prompts, and collaborating with experts to ensure safety and compliance in AI outputs. Ideal candidates have experience in AI red teaming and strong analytical skills. Flexible hourly commitment of 5-6 hours.

Qualifikationen

  • Proven experience in AI red teaming or adversarial prompt design.
  • Familiarity with ethical considerations in generative AI.
  • Understanding of LLM behaviours and failure modes.

Aufgaben

  • Conduct Red Teaming exercises to identify harmful outputs.
  • Evaluate and stress-test AI prompts to uncover failure modes.
  • Collaborate with researchers to report risks and suggest mitigations.

Kenntnisse

AI red teaming
Prompt engineering
NLP tasks
Analytical writing
Jobbeschreibung
Overview

Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine.

By combining advanced machine learning and artificial intelligence (ML / AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of AI. Innodata offers a powerful combination of both digital data solutions and easy-to-use, high-quality platforms.

Our global workforce includes over 5,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany.

Job Title

AI Red Teaming & Prompt Evaluation Specialist

Hourly Commitment

5-6 hours

Key Responsibilities
  • Conduct Red Teaming exercises to identify adversarial, harmful, or unsafe outputs from large language models (LLMs).
  • Evaluate and stress-test AI prompts across multiple domains (e.g., finance, healthcare, security) to uncover potential failure modes.
  • Develop and apply test cases to assess accuracy, bias, toxicity, hallucinations, and misuse potential in AI-generated responses.
  • Collaborate with data scientists, safety researchers, and prompt engineers to report risks and suggest mitigations.
  • Perform manual QA and content validation across model versions, ensuring factual consistency, coherence, and guideline adherence.
  • Create evaluation frameworks and scoring rubrics for prompt performance and safety compliance.
  • Document findings, edge cases, and vulnerability reports with high clarity and structure.
Requirements
  • Proven experience in AI red teaming, LLM safety testing, or adversarial prompt design.
  • Familiarity with prompt engineering, NLP tasks, and ethical considerations in generative AI.
  • Strong background in Quality Assurance, content review, or test case development for AI / ML systems.
  • Understanding of LLM behaviours, failure modes, and model evaluation metrics.
  • Excellent critical thinking, pattern recognition, and analytical writing skills.
  • Ability to work independently, follow detailed evaluation protocols, and meet tight deadlines.
Hol dir deinen kostenlosen, vertraulichen Lebenslauf-Check.
eine PDF-, DOC-, DOCX-, ODT- oder PAGES-Datei bis zu 5 MB per Drag & Drop ablegen.