
Attiva gli avvisi di lavoro via e-mail!
A leading technology consultancy is seeking a Senior RAG Systems Developer to create a prototype for legal document reasoning. This role is a contract/freelance position that pays $600 upon successful completion of a technical evaluation. The ideal candidate will have expertise in graph-based retrieval and Python 3.12, focusing on delivering high accuracy and explainability. Successful performance may lead to a long-term opportunity.
GraphRAG Developer Challenge – Legal Document Processing (Prototype)
We’re seeking an expert in graph-based retrieval (GraphRAG) to build a high-accuracy prototype for legal document reasoning. This is a paid technical test that may lead to a long-term position. The goal is a true GraphRAG system featuring explicit knowledge-graph construction and traversal, multi-hop reasoning, agentic orchestration, and strong focus on retrieval accuracy and explainability.
Download materials :
(Benchmark uses unseen questions.)
Implement two functions in Python 3.12 (Poetry project):
No UI, no API keys provided. Any stack may be used. query(...) must support parallel execution (~400 questions in ≤60 min) and show a progress indicator. Test thoroughly for correctness and performance before the demo.
In a 60-minute live session you will :
Only the developer(s) who wrote the code may present.
Passing requires an overall score above 95%, measured by (LLM as a judge): Faithfulness (grounded, no hallucinations), Relevance (retrieval matches intent), Completeness (covers key legal points), and Clarity (structured, legally coherent writing).
If you pass, you’ll receive $600 USD after verification of reproducibility and hand-over of the repo (codebase, Poetry lock, run instructions, brief tech note). Top performers may be invited to interview for a long-term paid role. Failure to pass or complete within 60 minutes = no payment (you keep your code).
Parallelization, graph-based reasoning, correctness, and explainability.
We will not engage in pre-contract discussions with agencies. If an agency wishes to propose a developer, communication will proceed only after that developer passes the benchmark test. This ensures time efficiency and direct technical validation.