disclosure-bureau

Author	SHA1	Message	Date
Luiz Gustavo	eaf282c535	W2: rerank opt-in, analyze_image_region tool, RAG eval, graph cleanup, ADRs Some checks failed CI / Web — typecheck + lint + build (push) Failing after 40s Details CI / Scripts — Python smoke (push) Failing after 3s Details CI / Web — npm audit (push) Failing after 29s Details CI / Retrieval — golden set (Recall@5 + MRR) (push) Failing after 3s Details - TD#8 hybrid.ts: rerank_strategy {always\|when_top_k_gt\|never} + threshold (default skips rerank for top_k ≤ 15; chat tool uses threshold 10) - O11 vision.ts + tools.ts: analyze_image_region tool — sharp-crops the bbox, claude CLI reads the temp PNG via Read tool, Sonnet vision answers - TD#12 /graph: SigmaGraph replaces ForceGraphCanvas; react-force-graph-2d uninstalled (-37 transitive deps); force-graph-canvas.tsx deleted - TD#27 messages/route.ts gatherContext slice sizes via CTX_* env vars - TD#22 tests/rag/: golden.yaml (15 queries) + run.py (Recall@k + MRR + negative-pass rate) + baseline.json + CI job in .forgejo/workflows/ci.yml - docs/adrs/: ADR-001..005 published from systems-atelier deliverables Verified live on disclosure.top: top_k=5 path skips rerank (6.7s embed-only, was 12-15s with rerank); rerank=always still available on demand. First RAG baseline: Recall@5 = 0.2083, MRR = 0.25, Negative pass = 1.0. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-23 19:20:09 -03:00
Luiz Gustavo	504b20fa5c	search: gate dense recall by cosine-distance threshold in the RPC Root-cause fix for "search returns garbage for absent terms". The hybrid RPC's dense branch always returned its k nearest vectors regardless of distance, so a query for a term not in the corpus (e.g. "varginha") surfaced unrelated chunks. The cross-encoder reranker would filter these but costs 18-62s on CPU — unusable for interactive search. Add max_dense_dist (default 0.40) to hybrid_search_chunks: dense neighbours beyond that cosine distance are dropped server-side. Calibrated from measured distances — strong semantic match ~0.12-0.20, no real match ~0.46-0.53. BM25 full-text still matches literal terms; the reranker becomes opt-in refinement. Verified live: varginha/abducao → 0, disco voador/roswell → relevant, all <1s. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 16:36:56 -03:00
Luiz Gustavo	4865f974b6	fix search: rerank-gate results so absent terms return nothing The hybrid_search RPC always returns up to recall_k dense neighbours, so a query for a term absent from the corpus (e.g. "varginha") returned its 12 nearest vectors — irrelevant chunks like PAGE_NUMBER "1". Two bugs: the reranker was skipped whenever results <= top_k, and there was no relevance floor. Now always run the cross-encoder reranker (BGE-reranker-v2-m3, normalized sigmoid) and drop hits below 0.02. Verified: "varginha" → 0 results; "roswell"/"tic tac"/"disco voador" → relevant hits on top (reranker cleanly separates 0.0001 garbage from 0.03-0.27 matches). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 14:46:49 -03:00
guto	19d0678e55	baseline: Disclosure Bureau pipeline + Next.js UI + Supabase stack	2026-05-17 22:44:36 -03:00

4 commits