About Siyada Tech
Siyada Tech is a Saudi technology company at the forefront of AI innovation, agentic AI, and digital transformation. We help organizations embrace the future of intelligent systems and scalable digital solutions.
Title
RAG Data Engineer (The RAG side can be learned)
(Guardian of Context. Slayer of Hallucinations.)
Mission
First be a team player, and Curate, structure, and deliver high-integrity data to our Retrieval-Augmented Generation systems so they stop inventing fairy tales and instead become the most accurate AI in the Kingdom.
Responsibilities
- Design and maintain ingestion pipelines for documents, DBs, CRM logs, PDFs, policies, emails, and whatever else management throws your way
- Transform unstructured horrors into clean embeddings-ready data (chunking, metadata tagging, semantic structure)
- Build automated monitoring for drift, stale knowledge, broken links, and hallucination triggers
- Manage vector database lifecycle: indexing strategies, deduplication
- Ensure iron-clad data lineage from source to query output
- Collaborate with AI engineers to tune retrieval performance (recall, precision, ranking)
- Maintain a central knowledge governance system: categorization, versioning, access controls
- Run periodic quality audits so the model doesn’t accidentally cite a 2014 blog post as law
- Document processes with a clarity unheard of in software teams
Skills & Tools
- LLM-friendly preprocessing: chunking logic, semantic splitting, OCR, annotation tools
- Metadata, schema design, data modeling for knowledge retrieval
- API integration and webhook orchestration
- Monitoring and observability for both data quality and AI performance
- Bonus points: hands‑on with LangChain/LlamaIndex or custom RAG architectures
Mindset
- Gets irrationally angry at duplicated documents
- Knows that “data quantity” is not “data quality”
- Believes hallucination is a crime against humanity (and product demos)
- Works proactively, with an almost religious dedication to truth
KPIs
- Latency drops while context accuracy rises
- % of content with fresh metadata increased
- Meaningfully fewer “Wait, did the AI just make that up?” moments
Why SiyadaTech
- You become the unseen architect of truth inside an AI powerhouse. The one who ensures our smartest systems speak facts, not fiction.