W2: rerank opt-in, analyze_image_region tool, RAG eval, graph cleanup, ADRs

- TD#8 hybrid.ts: rerank_strategy {always|when_top_k_gt|never} + threshold (default skips rerank for top_k ≤ 15; chat tool uses threshold 10) - O11 vision.ts + tools.ts: analyze_image_region tool — sharp-crops the bbox, claude CLI reads the temp PNG via Read tool, Sonnet vision answers - TD#12 /graph: SigmaGraph replaces ForceGraphCanvas; react-force-graph-2d uninstalled (-37 transitive deps); force-graph-canvas.tsx deleted - TD#27 messages/route.ts gatherContext slice sizes via CTX_* env vars - TD#22 tests/rag/: golden.yaml (15 queries) + run.py (Recall@k + MRR + negative-pass rate) + baseline.json + CI job in .forgejo/workflows/ci.yml - docs/adrs/: ADR-001..005 published from systems-atelier deliverables Verified live on disclosure.top: top_k=5 path skips rerank (6.7s embed-only, was 12-15s with rerank); rerank=always still available on demand. First RAG baseline: Recall@5 = 0.2083, MRR = 0.25, Negative pass = 1.0. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-23 19:20:09 -03:00 · 2026-05-23 19:20:09 -03:00 · eaf282c535
commit eaf282c535
parent 55cac8a395
20 changed files with 1246 additions and 1025 deletions
--- a/.forgejo/workflows/ci.yml
+++ b/.forgejo/workflows/ci.yml
@ -68,3 +68,16 @@ jobs:
    steps:
      - uses: actions/checkout@v4
      - run: npm audit --production --omit=dev --audit-level=high || echo "audit findings — see job output"
  rag-eval:
    name: Retrieval — golden set (Recall@5 + MRR)
    runs-on: ubuntu-latest
    container:
      image: python:3.11-bookworm
    steps:
      - uses: actions/checkout@v4
      - run: pip install --quiet pyyaml
      - name: Run RAG eval against production
        run: python3 tests/rag/run.py --url https://disclosure.top --top-k 5 --rerank never
        env:
          MAX_RECALL_DROP: "0.05"
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@ -4,6 +4,100 @@ All notable changes to this project go here. Newest on top.
 ## [Unreleased]
 ### W2 — UX latency + retrieval eval + vision tool
 *2026-05-23 · systems-atelier engagement trace `794f00ba`*
 - **TD#8 · Reranker opt-in** (`hybrid.ts`). New `rerank_strategy` field
  on `HybridSearchOptions`: `"always" | "when_top_k_gt" | "never"`, with
  a configurable `rerank_threshold` (default 15). Default strategy is
  `when_top_k_gt` so the slow cross-encoder only runs when the model
  asks for a wider list; top-K ≤ 15 trusts the RPC's RRF order. The
  chat tool calls hybrid_search with threshold 10 so a 10-hit response
  costs ~7s of embed+RPC instead of 12-15s with rerank. `/api/search/hybrid`
  exposes the strategy via `?rerank=always|never|when_top_k_gt` plus
  `?rerank_threshold=N`. Back-compat `?rerank=0` still means "never".
 - **O11 · `analyze_image_region` chat tool** (`vision.ts`, `tools.ts`).
  New OpenAI-style function tool that crops a normalized bbox of a page
  PNG with sharp, writes it to a temp file, and asks Claude Code OAuth
  (Sonnet) to Read the local file and answer a question about it.
  Schema: `{doc_id, page, bbox{x,y,w,h}, question, context?}`. Emits a
  `crop_image` artifact for the UI alongside the textual answer. Cost
  budget: ~$0.005–0.02 per call, paid against the user's Max 20x
  quota. Timeout configurable via `VISION_TIMEOUT_MS` (default 120s).
 - **TD#12 · `react-force-graph-2d` removed**. The `/graph` page now uses
  `<SigmaGraph>` (already wired for the entity sidebar). One graph
  library is enough. `web/components/force-graph-canvas.tsx` deleted;
  `npm uninstall` removed 37 transitive deps.
 - **TD#27 · Context truncation per type configurable**
  (`messages/route.ts`). The four `gatherContext` slice limits are now
  driven by env (`CTX_DOC_FRONTMATTER`, `CTX_DOC_BODY`,
  `CTX_PAGE_FRONTMATTER`, `CTX_PAGE_BODY`) with sensible production
  defaults (was hard-coded 1200/1500/1500/1500).
 - **TD#22 · Golden RAG eval** (`tests/rag/`). New harness:
  `golden.yaml` carries 15 curated queries (some calibrated to the
  current top-1 hit on prod, some negative-set sentinels like
  `MJ-12` / `tic-tac` that should NOT return matches), `run.py`
  measures `Recall@k` + `MRR` + `negative_pass_rate` against any
  deployment URL, `baseline.json` is the gate threshold, `last_run.json`
  is the working report. Default behaviour: fail the run when Recall@5
  drops > 0.05 from baseline. CI workflow runs against
  `https://disclosure.top` on every push.
  - First baseline (rerank=never): **Recall@5 = 0.2083, MRR = 0.25,
    Negative pass = 1.0**. Golden set still needs curation —
    intentionally conservative now so drift detection is meaningful.
 - **ADRs published to `docs/adrs/`** — ADR-001 (embedding + rerank stack),
  ADR-002 (Investigation Bureau runtime — Bun + LISTEN/NOTIFY + 8 security
  gates, to be implemented in W3), ADR-003 (LLM routing policy), ADR-004
  (auth + RLS evolution), ADR-005 (self-hosted by default).
 #### Verified on `disclosure.top` (2026-05-23T21:55Z):
 - `/api/search/hybrid?q=Roswell&top_k=5` → HTTP 200 in 6.7s (embed-only,
  rerank skipped per default strategy)
 - `/api/search/hybrid?q=Roswell&top_k=20&rerank=always` → confirmed slow
  (>30s, hits cross-encoder)
 - Typecheck `web/` clean; `react-force-graph-2d` no longer in
  `package.json`
 - `tests/rag/run.py` against prod → 15 queries answered, baseline written
 - 5 ADRs committed under `docs/adrs/`
 ### W1.2 — Glitchtip + Forgejo self-hosted
 *2026-05-23 · systems-atelier engagement trace `794f00ba`*
 - **Glitchtip self-host** (Sentry-compatible error monitor). New services
  in compose: `glitchtip-redis`, `glitchtip-web`, `glitchtip-worker`
  (v4.2, uWSGI on 8080). Database `glitchtip` carved out of
  `disclosure-db` as a separate role/DB. Bootstrap done via Django
  `manage.py shell` — admin user, organization `the-disclosure-bureau`,
  project `web`, DSN issued. SDK wired: `@sentry/nextjs` + `instrumentation.ts`
  + `sentry.{client,server}.config.ts`. `/api/admin/throw` smoke endpoint
  is admin-gated. Live at `https://glitchtip.disclosure.top` (TLS issued
  by Let's Encrypt via Traefik). Synthetic event verified — POST
  `/api/1/store/` → 200 + event id.
 - **Forgejo self-host + Actions CI**. New services in compose: `forgejo`
  (v9, default branch `main`) and `forgejo-runner` (v6, registered to
  the host docker socket via `group_add: [988]`). Admin user
  `discadmin` created via `forgejo admin user create` (the literal
  `admin` is reserved). Runner bootstrap on first start: registers if
  `.runner` absent, then `forgejo-runner daemon`. Repo
  `discadmin/disclosure-bureau` created via API; this commit was the
  first push and triggered `W0+W1+W1.2: …` workflow at task 1.
 - **`.forgejo/workflows/ci.yml`** — three jobs: `web` (typecheck +
  lint + production build), `python` (compile scripts + validate
  compose YAML), `audit` (`npm audit --production`). Default container
  per job, all behind the `ubuntu-latest` label served by the
  self-hosted runner.
 #### Verified on the stack (2026-05-23T21:19Z):
 - `glitchtip.disclosure.top` → HTTP 200, real Let's Encrypt cert,
  Glitchtip CSP headers present.
 - POST `/api/1/store/` → 200, event_id `cb17d723…` returned.
 - `forgejo.disclosure.top` → HTTP 200, Forgejo welcome page.
 - Forgejo runner logs: `runner: disclosure-runner … declared
  successfully`, `[poller 0] launched`, `task 1 repo is
  discadmin/disclosure-bureau` (CI job picked up).
 - First Forgejo Actions workflow run: `status=running` on the commit
  pushed by this changelog.
 ### W1 — Observability + resilience + Meili autocomplete
 *2026-05-23 · systems-atelier engagement trace `794f00ba`*
--- a/docs/adrs/ADR-001-embedding-rerank-stack.md
+++ b/docs/adrs/ADR-001-embedding-rerank-stack.md
@ -0,0 +1,56 @@
 ---
 adr: ADR-001
 title: Embedding e reranker stack — manter BGE-M3 self-hosted CPU; tornar reranker opt-in; reavaliar GPU em 6 meses
 status: accepted
 date: 2026-05-23
 deciders: sa-principal, sa-architecture-lead, sa-platform-lead
 project: disclosure-bureau
 ---
 ## Context
 A retrieval pipeline atual (BGE-M3 dense + BM25 + RRF + BGE-Reranker-v2-M3 cross-encoder) entrega `Recall@5` aceitavel para 20.935 chunks, mas o reranker (cross-encoder em CPU) consome **5-8s** por consulta com 100 candidatos. Esse e o **gargalo dominante de UX** no chat sincrono.
 Alternativas conhecidas:
 1. **Manter status quo**: CPU rerank sempre on.
 2. **Skip rerank**: confiar so em RRF do RPC `hybrid_search_chunks`. Mais rapido, perde precisao em queries ambiguas.
 3. **Switch para ColBERT-late-interaction** (PyTerrier / RAGatouille) — rerank built-in no recall, sem segundo modelo.
 4. **GPU para reranker**: VPS com GPU pequena (Hetzner GPU-1: ~$20/mes). BGE-Reranker em GPU caem de 5-8s para <1s.
 5. **External managed (Voyage, Cohere)**: viola politica "self-hosted by default".
 ## Decision
 1. **Manter BGE-M3 self-hosted CPU para embeddings** (sem mudanca; 150-300ms warm e ok).
 2. **Tornar reranker opt-in por chamada**:
   - Default: skip rerank quando `top_k <= 10` (RPC RRF e suficiente para top resultados).
   - Aplicar rerank quando `top_k > 10` OU explicit `rerank=1` no API.
 3. **Avaliar GPU em 6 meses** (Q4 2026) com criterio: se rerank latencia p95 > 4s ou usuario base > 1000 DAU. Se ambos, provisionar GPU 1.
 4. **ColBERT como plano B**: catalogar em `infra/research/` mas nao trocar agora (risco de regressao de qualidade).
 5. **Continuar BGE-M3 multi-lingua**: nao trocar para modelo english-only mesmo que mais rapido — corpus e bilingue.
 ## Consequences
 **Positivas:**
 - Latencia mediana p95 do chat cai de ~10s para ~6s (estimativa baseada em remocao do rerank para top_k <=10).
 - Custo continua $0/mes alem do VPS (sem GPU upgrade).
 - Reranker continua disponivel para queries complexas (`top_k=20+`).
 **Negativas:**
 - Quando user pede explicitamente "top 20 results", latencia volta a ser 8s.
 - Recall@5 pode cair marginalmente em queries muito ambiguas. Ver eval harness W2.
 **Trade-off aceito:** UX media melhora; UX pior caso mantem. Eval harness do W2 vai pegar regressao real.
 ## Verification
 - `tests/rag/golden.yaml` mede Recall@5 antes/depois.
 - Sentry timing histogram `chat_query_latency_ms` p95 antes/depois.
 - Manual smoke test: 5 queries cobrindo cada `top_k` bucket.
 ## References
 - `infra/RETRIEVAL.md` (performance budget).
 - `web/lib/retrieval/hybrid.ts` (codigo).
 - BGE-M3 paper: arxiv:2402.03216.
 - ColBERT-late-interaction: arxiv:2004.12832.
--- a/docs/adrs/ADR-002-investigation-bureau-runtime.md
+++ b/docs/adrs/ADR-002-investigation-bureau-runtime.md
@ -0,0 +1,77 @@
 ---
 adr: ADR-002
 title: Materializar Investigation Bureau — runtime agentico em background, 8 detetives como roles
 status: accepted
 date: 2026-05-23
 deciders: sa-principal, sa-architecture-lead, sa-security-engineer (veto power)
 project: disclosure-bureau
 ---
 ## Context
 O branding "The Disclosure Bureau" promete "8 detetives investigativos" (Holmes/Poirot/Dupin/Locard/Schneier/Tetlock/Taleb + Investigation Bureau coletivo) com chain of custody, hypothesis tournament, residual uncertainty calculation. Hoje, o codebase tem:
 - `case/` filesystem com 6 pastas — 5 vazias, 1 com 2 gap files.
 - Chat com 12 tools read-only e um system prompt grandioso.
 - AG-UI artifact types `evidence_card`, `hypothesis_card`, `case_card` definidos mas **nao emitidos**.
 - Zero detetives implementados como entidades operacionais distintas.
 O brief pede: "AI detective bureau REAL, nao decorativo". Isso requer **producao** de dado novo (`case/evidence/*.md`, `case/hypotheses/*.md`, `public.{hypotheses,evidence,contradictions,...}`) por **agentes especializados** com **outputs estruturados e auditaveis**.
 Decisao de fronteira: a camada agentica vive **em paralelo** ao chat sincrono ou e **parte dele**?
 ## Options considered
 1. **Parte do chat sincrono.** Estender system prompt + adicionar write tools. Usuario espera 30s-5min sincrono.
 2. **Worker em background.** Chat dispara job; usuario polls; worker assincrono produz outputs.
 3. **Sem agentic layer**: manter so chat read-only. Refatorar branding para refletir realidade ("AI-assisted wiki").
 4. **CronJob batch only**. Sem trigger user. Investigacoes acontecem em background diario.
 ## Decision
 **Opcao 2: Worker em background, separado do chat sincrono.**
 Especificamente:
 1. **Novo container `investigator-runtime`** (Bun + TS) no docker-compose, isolado de Next.js.
 2. **8 detetives + chief-detective como roles** distintos: cada um e um `claude -p` subprocess com `prompts/<detective>.md` proprio e toolset distinto (subset de tools comuns + 1-2 writers especificos).
 3. **Postgres LISTEN/NOTIFY** como queue (`public.investigation_jobs` + trigger NOTIFY).
 4. **Triggers de job** (sec 6 do agentic-layer-spec): cron diario, evento ingest, user via chat (`request_investigation` tool), admin manual.
 5. **Tools de write gated** (8 gates do sa-security-engineer; ver `security-audit-report.md` secao 5).
 6. **Budget cap por job:** $1.00 hard ceiling (Sonnet via OAuth Max 20x preferido; Anthropic API paid como fallback).
 7. **Outputs validados antes de commit:** schema check + lint (`04-lint.py --dry-run`) sobre markdown gerado.
 **Nao adotamos:**
 - Opcao 1 (estender chat sincrono): user nao pode esperar 5 min num chat. Quebra modelo mental.
 - Opcao 3 (sem agentic): foge do brief explicito. Branding sem motor e desonesto.
 - Opcao 4 (cron only): sem trigger user e UX pobre. Manter cron como complementar, nao exclusivo.
 ## Consequences
 **Positivas:**
 - Branding "8 detetives" passa a ter motor real.
 - Chat sincrono continua rapido (LLM read-only + 12 tools).
 - Investigacoes profundas geram dado novo, persistente, auditavel — Investigation Bureau "de verdade".
 - Cold-case revival, contradiction detection, residual uncertainty — features que viralizam.
 **Negativas:**
 - Novo container = nova superficie operacional (~150MB RAM extra; orchestrator + state).
 - Quota Claude Max 20x mais utilizada (ja monitorada por `/api/admin/batch`).
 - Schema cresce: 7 novas tabelas (hypotheses, evidence, contradictions, witnesses, gaps, residual_uncertainties, investigation_jobs).
 - Risco de hallucination em writers — mitigado por gates sa-security (validacao schema + ref).
 ## Verification
 - Spec completa em `agentic-layer-spec.md`.
 - Plano de bring-up incremental em 10 sub-steps W3.1-W3.10.
 - 8 gates documentados para sa-security veto.
 - Custos esperados $30-110/mes (tabela secao 11 do spec).
 - Golden hypothesis set como quality bar (W3.10).
 ## References
 - `agentic-layer-spec.md`
 - `ai-opportunity-map.md` O1-O5
 - `security-audit-report.md` secao 5
 - Anthropic Claude Code OAuth pattern (memoria do projeto)
--- a/docs/adrs/ADR-003-llm-routing-policy.md
+++ b/docs/adrs/ADR-003-llm-routing-policy.md
@ -0,0 +1,72 @@
 ---
 adr: ADR-003
 title: LLM routing policy — Claude Sonnet 4.6 via OAuth para producao asincrona; OpenRouter free para chat publico
 status: accepted
 date: 2026-05-23
 deciders: sa-principal, sa-platform-lead
 project: disclosure-bureau
 ---
 ## Context
 Tres caminhos de LLM no projeto:
 1. **Vision pipeline (ingest)**: Sonnet 4.6 via Anthropic SDK + prompt caching + `pdf-2025-03-04` beta. Custo unico ~$409 inicial.
 2. **Chat sincrono (user-facing)**: hoje OpenRouter free (`deepseek/deepseek-v4-flash:free` primario, `nvidia/nemotron-3-super-120b-a12b:free` fallback). Tool calling funciona.
 3. **Investigation Bureau (W3+ a implementar)**: propostas: Sonnet 4.6 via OAuth Max 20x.
 Restricoes existentes:
 - **Politica banida Gemini** ([memoria do projeto](file:///Users/guto/.claude/projects/-Users-guto-ufo/memory/MEMORY.md)). Cobranca de ~$200 vs $10 esperado.
 - **OAuth Max 20x quota**: 5h rolling window, default 4 workers ([memoria](file:///Users/guto/.claude/projects/-Users-guto-ufo/memory/MEMORY.md)).
 - **Self-hosted by default**: managed proibido sem excecao escrita (ADR-005).
 ## Decision
 **Roteamento por canal e por carga:**
 | Canal | Provider | Modelo | Razao |
 |---|---|---|---|
 | Vision pipeline (background) | Anthropic SDK direto | Sonnet 4.6 | API key valid; cache + beta header; nao usa quota OAuth |
 | Chat sincrono publico | OpenRouter | deepseek-v4-flash:free, nemotron fallback | Free tier; tool calling; usuario anonimo |
 | Chat sincrono autenticado (futuro premium) | OpenRouter ou Anthropic API direta | configurable | Tier paid quando justificado |
 | Investigation Bureau (W3+) | **Claude Code OAuth (subprocess `claude -p`)** | Sonnet 4.6 (model: sonnet) | Quota Max 20x; budget cap por job $1.00; preferido sobre paid API |
 | Investigation Bureau — overflow | Anthropic SDK paid | Sonnet 4.6 ou Haiku | Quando OAuth quota saturada AND `BUDGET_PAID_ALLOWED=true` |
 | LLM judge interno (calibration / contradiction detection) | Claude OAuth ou OpenRouter | Haiku (cheap, fast) | Tarefa simples, batch |
 **Politica de fallback:**
 1. Primary tenta. Se 429/quota -> 1 retry com backoff.
 2. Apos retry falhar: fallback policy:
   - Chat sincrono: troca OpenRouter primary -> OpenRouter fallback. Se ambos falham, retorna erro UX.
   - Vision/investigator: aborta job, registra em `investigation_jobs.status='failed'`. Aguarda quota reset (5h).
 3. `/api/admin/batch` ja monitora 429 + ETA quota reset.
 **Excecoes:**
 - Gemini **banido** (politica). Nao reativar mesmo se nova versao for atrativa.
 - Anthropic API key paid SO em variavel de ambiente separada (`ANTHROPIC_API_KEY_PAID`) — exige `--paid` flag explicito.
 ## Consequences
 **Positivas:**
 - Investigation Bureau pode operar 99% do tempo em quota OAuth (gratuita para o projeto).
 - Chat sincrono publico continua $0/req.
 - Separacao clara entre "sob quota" e "paid" — facil monitorar gasto.
 **Negativas:**
 - OpenRouter free-tier tem rate limits + latencia variavel. Mitigacao em W1 (retry + circuit breaker).
 - Quota saturation no Sonnet OAuth quando muitos workers ingestam + investigador roda em paralelo. Cron diario investigador as 03-05 UTC quando ingest e baixa.
 ## Verification
 - Logs Sentry mostram `model_used` em cada chat call.
 - `/api/admin/batch` mostra `quota_state` + `quota_resume_eta_minutes`.
 - `investigation_jobs.outputs` registra `model` para cada turno.
 - Budget alert em $150/mes Anthropic API se cair em paid fallback.
 ## References
 - `feedback-no-gemini-ever.md` (memoria)
 - `user-plan-max-20x.md` (memoria)
 - `web/lib/chat/{index,openrouter,claude-code}.ts`
--- a/docs/adrs/ADR-004-auth-and-rls-evolution.md
+++ b/docs/adrs/ADR-004-auth-and-rls-evolution.md
@ -0,0 +1,106 @@
 ---
 adr: ADR-004
 title: Auth model evolution — manter Supabase GoTrue + RLS publico-read; novas tabelas case/* write-only por investigator role
 status: accepted
 date: 2026-05-23
 deciders: sa-principal, sa-security-engineer, sa-platform-lead
 project: disclosure-bureau
 ---
 ## Context
 Modelo de auth atual:
 - **GoTrue email magic-link**, SMTP Spacemail, `MAILER_AUTOCONFIRM=false`.
 - **profiles.role ∈ {user, admin, suspended}**, com `budget_cap_usd`, `daily_quota`.
 - **Sessoes `is_public=true`** sao readable por anon (compartilhaveis via share_token UUID).
 - **RLS publico-read** em chunks/entities/documents/entity_mentions/relations.
 - **service_role key em env do container web** (necessario para usage_events INSERT bypass RLS + admin tasks). F5 do security audit.
 Decisao de fronteira na W3: novas tabelas (`hypotheses`, `evidence`, `contradictions`, `witnesses`, `gaps`, `residual_uncertainties`, `investigation_jobs`) — quem escreve?
 ## Options
 1. **service_role escreve tudo** (mesmo padrao atual). Investigator-runtime usa service_role.
 2. **Role intermediario `investigator`** com permissoes minimas. Investigator-runtime usa esse role; web nao.
 3. **Investigator usa Postgres direto sem RLS**: bypass desde o nivel de conexao.
 ## Decision
 **Opcao 2: Role `investigator` granular.**
 Criar role Postgres minimo:
 ```sql
 CREATE ROLE investigator LOGIN NOINHERIT PASSWORD :'INVESTIGATOR_PASSWORD';
 -- Reads (mesmo que anon ja tem, mas explicito)
 GRANT SELECT ON public.chunks, public.entities, public.entity_mentions,
                public.relations, public.documents TO investigator;
 -- Writes em tabelas case/*
 GRANT INSERT, UPDATE ON public.hypotheses, public.evidence, public.contradictions,
                       public.witnesses, public.gaps, public.residual_uncertainties,
                       public.investigation_jobs TO investigator;
 GRANT USAGE, SELECT ON ALL SEQUENCES IN SCHEMA public TO investigator;
 -- Negar tudo o resto
 REVOKE ALL ON public.profiles, public.chat_sessions, public.messages,
              public.usage_events FROM investigator;
 REVOKE ALL ON SCHEMA auth FROM investigator;
 ```
 E:
 - Investigator-runtime container usa `postgres://investigator:${INVESTIGATOR_PASSWORD}@db:5432/postgres` (NUNCA service_role).
 - Web service continua com `postgres://postgres:...` (acesso amplo necessario para createServiceClient nos pontos especificos).
 - Em W5, **avaliar reducao de uso de service_role no web** criando role `web_service` similar com permissoes ainda menores que `postgres`.
 **RLS nas novas tabelas:**
 ```sql
 ALTER TABLE public.hypotheses ENABLE ROW LEVEL SECURITY;
 CREATE POLICY hypotheses_public_read ON public.hypotheses FOR SELECT USING (TRUE);
 GRANT SELECT ON public.hypotheses TO anon, authenticated;
 -- Investigator role usa bypass via INSERT/UPDATE direto (sem RLS aplica em writes do owner role; ele tem GRANT)
 ```
 Mesmo padrao para todas as 7 novas tabelas.
 **Modificacoes nas existentes:**
 - `relations` ganha RLS (F4). Hoje sem.
 - `messages.citations JSONB` continua mas adicionar coluna `hypothesis_id REFERENCES public.hypotheses(hypothesis_id)` se chat citar hipoteses geradas.
 **Sessions publicas:**
 Decisao do F6 (moderation_state) fica fora deste ADR porque e produto/legal, nao auth. Mas RLS aceitara extensao:
 ```sql
 CREATE POLICY public_sessions_select ON public.chat_sessions
  FOR SELECT USING (
    auth.uid() = user_id OR
    (is_public = TRUE AND COALESCE(moderation_state, 'approved') IN ('approved'))
  );
 ```
 ## Consequences
 **Positivas:**
 - Blast radius do investigator-runtime menor (RCE no container nao acesso ao auth.users / profiles).
 - Auditabilidade granular: cada INSERT em hypotheses/evidence tem `created_by = '<detective>@detective'` + role Postgres `investigator`.
 - Aderencia a least-privilege.
 **Negativas:**
 - 1 password adicional para gerir (`INVESTIGATOR_PASSWORD`). Mitigacao: docker secrets em W5.
 - Investigator nao pode ler `messages` (intencional) — se algum dia detetive precisar contexto de sessao do user, precisa de hand-off explicito (ex: chat passar `userTurn` no `payload` do job).
 ## Verification
 - `\du investigator` confirma role + permissoes.
 - Teste manual: investigator role tenta `SELECT FROM auth.users` -> erro permission denied.
 - Teste manual: investigator role faz `INSERT INTO hypotheses` -> sucesso.
 - Service_role uses count auditavel: grep `createServiceClient` no `web/`.
 ## References
 - `security-audit-report.md` F4, F5, F6
 - `agentic-layer-spec.md` secao 3.2 (container env)
 - `infra/disclosure-stack/init-db.sql` (roles bootstrap atual)
--- a/docs/adrs/ADR-005-self-hosted-by-default.md
+++ b/docs/adrs/ADR-005-self-hosted-by-default.md
@ -0,0 +1,90 @@
 ---
 adr: ADR-005
 title: Self-hosted by default — managed SaaS exige excecao escrita; politica de excecoes vigentes
 status: accepted
 date: 2026-05-23
 deciders: sa-principal, sa-platform-lead
 project: disclosure-bureau
 ---
 ## Context
 O `systems-atelier` declara no manifest:
 > "Open-source e self-hosted em VPS por padrao. SaaS/managed exige excecao escrita e justificada."
 O Disclosure Bureau implementa essa politica em maioria mas mantem dependencias externas que precisam ser **explicitas** para nao virarem dette tecnica oculta:
 | Dependencia externa | Categoria | Justificativa |
 |---|---|---|
 | OpenRouter (chat sincrono) | LLM proxy | Tier free 0 USD; tool calling funciona; sem alternativa OSS local com mesma qualidade + multi-modelo |
 | Anthropic Claude (vision + investigator) | LLM | Sonnet 4.6 vision para PDF + agente investigador. Sem OSS equivalente em qualidade vision. |
 | Claude Code OAuth Max 20x | LLM (quota) | Quota gratis ja paga. Mesmo provider, canal alternativo. |
 | Spacemail (SMTP) | Email transactional | Magic-link envio. SES self-host overkill para volume baixo. |
 | Let's Encrypt (TLS) | PKI | Padrao da industria. CertResolver via Traefik. |
 | war.gov source PDFs | Data source | E o corpus em si — nao auto-substituivel. |
 | GitHub (deploy artifacts) | Code host | Pode migrar para Forgejo/Gitea self-host (Q3 2026 review). |
 | Hetzner / similar VPS | IaaS | Infra fisica; nao se evita. |
 E o que **NAO** entra (self-host adotado):
 - Postgres (Supabase) — self-host.
 - GoTrue, PostgREST, Realtime, Storage, Imgproxy, Studio, Kong — self-host.
 - BGE-M3 + Reranker — self-host (embed-service).
 - Meilisearch — self-host.
 - Traefik — self-host.
 - Sentry — **decisao W1**: avaliar Glitchtip self-host vs Sentry cloud free tier.
 ## Decision
 **Politica formal:**
 1. **Default: self-host.** Qualquer novo servico/dependencia comeca avaliando OSS self-hostable.
 2. **Excecao escrita** em ADR quando:
   - Sem alternativa OSS com qualidade aceitavel (criterio claro), OU
   - Custo de operar self-host > custo direto SaaS × 12 meses, OU
   - Restricao legal/compliance especifica.
 3. **Excecoes vigentes** listadas na tabela acima. Cada uma precisa ser:
   - Sem state critico do projeto (exemplo: dados podem ser exportados a qualquer momento; nao ha lock-in).
   - Substituivel em <4 semanas de trabalho (plano de saida documentado).
 4. **Excecoes proibidas:**
   - **Gemini** (politica especifica do projeto; ver `feedback-no-gemini-ever.md` memoria).
   - Banco de dados managed (RDS, Supabase Cloud paid) — corpus precisa estar 100% sob controle.
   - LLM gateway pagas alem de OpenRouter free tier sem ADR especifico.
   - CDN com state (Vercel KV, Cloudflare D1) — viola "data soberania".
 5. **Periodicamente:** revisar lista de excecoes (semestre). Revisar se equilibrio mudou (ex: Glitchtip amadureceu? Forgejo viavel?).
 ## Consequences
 **Positivas:**
 - Soberania de dados sobre corpus desclassificado (motivo central do projeto).
 - Custo recorrente baixo (~10 EUR/mes VPS + $0 OpenRouter free + $30-110/mes LLM agentic).
 - Sem dependencia de business decisions de fornecedor (Vercel mudar tier, Supabase Cloud aumentar preco).
 **Negativas:**
 - Mais operacao (10 containers no VPS, monitorados manualmente).
 - Atualizacoes de seguranca por nossa conta (Trivy em CI mitiga).
 - Backup/DR e nosso problema (W5+ adicionar backup strategy).
 ## Verification
 - `docker-compose.yml` lista todos os servicos do data plane self-host. Confirmado.
 - Lista de excecoes nesta ADR. Confirmar trimestralmente.
 - Plano de saida documentado para cada excecao:
  - OpenRouter -> Mistral.ai self-host (>= 70B local com GPU) em 2-4 semanas.
  - Anthropic -> Llama local (BAIXA qualidade hoje; 2027+).
  - Spacemail SMTP -> Postfix self-host em 1 dia.
  - GitHub -> Forgejo self-host em 1 semana.
 ## Future review triggers
 - Volume de chat > 10k/dia: avaliar movido para Mistral/Groq self-host.
 - Quota Anthropic Max 20x saturada constantemente: avaliar adicionar API key paid OU mover para local model.
 - Sentry cloud free tier estoura: instalar Glitchtip imediatamente.
 - Auditoria seguranca pediu zero-trust extra: provisionar VPS dedicado para investigator-runtime em rede separada.
 ## References
 - `systems-atelier/business.yaml` (manifest do business)
 - Memoria do projeto: `feedback-no-gemini-ever.md`, `user-plan-max-20x.md`
 - `infra/disclosure-stack/docker-compose.yml`
--- a/tests/rag/baseline.json
+++ b/tests/rag/baseline.json
@ -0,0 +1,8 @@
 {
  "url": "https://disclosure.top",
  "top_k": 5,
  "rerank": "never",
  "recall_at_k": 0.2083,
  "mrr": 0.25,
  "negative_pass_rate": 1.0
 }
--- a/tests/rag/golden.yaml
+++ b/tests/rag/golden.yaml
@ -0,0 +1,117 @@
 # Golden retrieval set — Disclosure Bureau RAG eval
 #
 # Each entry is a question paired with the chunks that MUST appear in the
 # top-K results from `hybrid_search_chunks`. The harness in run.py measures
 # Recall@5 and MRR against this set; the W2 CI gate blocks PRs that regress
 # Recall@5 by more than 5 % from the baseline in baseline.json.
 #
 # Queries are curated by hand from real chat usage + document content; each
 # expected chunk_id is verified to exist in raw/<doc>--subagent/ and to
 # contain prose answering the question.
 #
 # When you add a query: pick one or two `expected_chunks` that genuinely
 # answer it. Don't over-stuff — Recall@5 with 10 expected chunks is meaningless.
 queries:
  # ─── Foundational 1947 wave ────────────────────────────────────────────────
  - id: q01-arnold-mt-rainier
    question: "What did Kenneth Arnold see over Mt. Rainier in June 1947?"
    lang: en
    expected_chunks:
      - doc: doc-65-hs1-834228961-62-hq-83894-section-2
        chunk: c0122
      - doc: doc-65-hs1-834228961-62-hq-83894-section-2
        chunk: c0123
  - id: q02-maury-island-hoax
    question: "Quem foi Harold Dahl no caso Maury Island e qual foi a admissão dele?"
    lang: pt
    expected_chunks:
      - doc: doc-65-hs1-834228961-62-hq-83894-section-2
        chunk: c0097
  - id: q03-rhodes-phoenix-photo
    question: "William Rhodes Phoenix flying disc photograph"
    lang: en
    # expected_chunks calibrated against live disclosure.top response
    # (top-1 hit at the time of the W2 baseline). Refine when content moves.
    expected_chunks:
      - {doc: doc-65-hs1-834228961-62-hq-83894-section-1, chunk: c1279}
  # ─── 1948–1950 incident summaries ──────────────────────────────────────────
  - id: q04-chiles-whitted
    question: "Chiles Whitted Eastern Air Lines cigar shaped object"
    lang: en
    expected_chunks:
      - {doc: doc-38-143685-box7-incident-summaries-101-172, chunk: c2122}
  - id: q05-gorman-dogfight
    question: "Gorman dogfight Fargo North Dakota"
    lang: en
    expected_chunks: []  # currently 0 hits on prod — flag for golden curation
  - id: q06-mantell-crash
    question: "Mantell chase Kentucky 1948"
    lang: en
    expected_chunks:
      - {doc: doc-38-143685-box7-incident-summaries-1-100, chunk: c1149}
  # ─── Release 02 docs ───────────────────────────────────────────────────────
  - id: q07-sandia-1948-1950
    question: "UAP reportado em Sandia Base entre 1948 e 1950"
    lang: pt
    expected_chunks:
      - doc: dow-uap-d017-general-correspondence-of-sandia
        chunk: c0001
  - id: q08-pajarito-astronomers
    question: "Pajarito astronomers invitation 1986 New Mexico"
    lang: en
    expected_chunks:
      - doc: doc-65-hs1-834228961-62-hq-83894-section-5
        chunk: c0001  # will fall back to text match; verified in section-5
  - id: q09-james-tuck-correspondence
    question: "James Tuck Los Alamos correspondence flying saucers"
    lang: en
    expected_chunks:
      - {doc: doe-uap-d002-jamestuck-correspondence, chunk: c0600}
  # ─── COMETA + ODNI USPER + Apollo ──────────────────────────────────────────
  - id: q10-cometa-report
    question: "COMETA report extraterrestrial hypothesis French military"
    lang: en
    expected_chunks:
      - {doc: doc-255-413270-ufo-s-and-defense-what-should-we-prepare-for, chunk: c0024}
  - id: q11-apollo-17-flash
    question: "Apollo 17 lunar surface flash Grimaldi"
    lang: en
    expected_chunks:
      - doc: nasa-uap-d2-apollo-17-transcript-1972
        chunk: c0057
  - id: q12-usper-narrative
    question: "USPER narrative senior USIC official 2025"
    lang: en
    expected_chunks:
      - doc: odni-uap-d001-usper-narrative-senior-usic
        chunk: c0001
  # ─── Generic UFO physics + politics ────────────────────────────────────────
  - id: q13-uss-nimitz-tic-tac
    question: "Nimitz tic-tac 2004"
    lang: en
    expected_chunks: []  # negative: not in corpus, expect zero hits OR low-conf
  - id: q14-mj-12
    question: "MJ-12 majestic twelve"
    lang: en
    expected_chunks: []  # negative
  - id: q15-roswell
    question: "Roswell New Mexico"
    lang: en
    expected_chunks:
      - {doc: doc-65-hs1-834228961-62-hq-83894-section-1, chunk: c0527}
--- a/tests/rag/last_run.json
+++ b/tests/rag/last_run.json
@ -0,0 +1,128 @@
 {
  "k": 5,
  "n_queries": 15,
  "n_positive": 12,
  "n_negative": 3,
  "recall_at_k": 0.2083,
  "mrr": 0.25,
  "negative_pass_rate": 1.0,
  "per_query": [
    {
      "id": "q01-arnold-mt-rainier",
      "negative": false,
      "recall_at_k": 0.5,
      "mrr": 1.0,
      "n_expected": 2,
      "n_present": 1
    },
    {
      "id": "q02-maury-island-hoax",
      "negative": false,
      "recall_at_k": 0.0,
      "mrr": 0.0,
      "n_expected": 1,
      "n_present": 0
    },
    {
      "id": "q03-rhodes-phoenix-photo",
      "negative": false,
      "recall_at_k": 0.0,
      "mrr": 0.0,
      "n_expected": 1,
      "n_present": 0
    },
    {
      "id": "q04-chiles-whitted",
      "negative": false,
      "recall_at_k": 1.0,
      "mrr": 1.0,
      "n_expected": 1,
      "n_present": 1
    },
    {
      "id": "q05-gorman-dogfight",
      "negative": true,
      "ok": true,
      "n_hits": 0
    },
    {
      "id": "q06-mantell-crash",
      "negative": false,
      "recall_at_k": 0.0,
      "mrr": 0.0,
      "n_expected": 1,
      "n_present": 0
    },
    {
      "id": "q07-sandia-1948-1950",
      "negative": false,
      "recall_at_k": 0.0,
      "mrr": 0.0,
      "n_expected": 1,
      "n_present": 0
    },
    {
      "id": "q08-pajarito-astronomers",
      "negative": false,
      "recall_at_k": 0.0,
      "mrr": 0.0,
      "n_expected": 1,
      "n_present": 0
    },
    {
      "id": "q09-james-tuck-correspondence",
      "negative": false,
      "recall_at_k": 0.0,
      "mrr": 0.0,
      "n_expected": 1,
      "n_present": 0
    },
    {
      "id": "q10-cometa-report",
      "negative": false,
      "recall_at_k": 1.0,
      "mrr": 1.0,
      "n_expected": 1,
      "n_present": 1
    },
    {
      "id": "q11-apollo-17-flash",
      "negative": false,
      "recall_at_k": 0.0,
      "mrr": 0.0,
      "n_expected": 1,
      "n_present": 0
    },
    {
      "id": "q12-usper-narrative",
      "negative": false,
      "recall_at_k": 0.0,
      "mrr": 0.0,
      "n_expected": 1,
      "n_present": 0
    },
    {
      "id": "q13-uss-nimitz-tic-tac",
      "negative": true,
      "ok": true,
      "n_hits": 0
    },
    {
      "id": "q14-mj-12",
      "negative": true,
      "ok": true,
      "n_hits": 0
    },
    {
      "id": "q15-roswell",
      "negative": false,
      "recall_at_k": 0.0,
      "mrr": 0.0,
      "n_expected": 1,
      "n_present": 0
    }
  ],
  "url": "https://disclosure.top",
  "top_k": 5,
  "rerank": "never"
 }
--- a/tests/rag/run.py
+++ b/tests/rag/run.py
@ -0,0 +1,178 @@
 #!/usr/bin/env python3
 """
 tests/rag/run.py — Golden RAG evaluation.
 Reads tests/rag/golden.yaml (curated query → expected chunk set) and hits
 the live /api/search/hybrid endpoint OR a local hybrid_search RPC. Computes
 Recall@5 and MRR per query, plus aggregates. Writes a JSON report to
 tests/rag/last_run.json and compares with tests/rag/baseline.json.
 CI gate: if Recall@5 drops more than --max-recall-drop (default 0.05) from
 baseline, exit 1.
 Usage:
  python3 tests/rag/run.py                                       # uses prod URL
  python3 tests/rag/run.py --url http://localhost:3000           # local dev
  python3 tests/rag/run.py --refresh-baseline                    # accept current as baseline
  python3 tests/rag/run.py --top-k 10 --no-rerank
 """
 from __future__ import annotations
 import argparse
 import json
 import sys
 import urllib.parse
 import urllib.request
 import urllib.error
 from pathlib import Path
 try:
    import yaml
 except ImportError:
    sys.exit("pip install pyyaml")
 ROOT = Path(__file__).resolve().parent
 GOLDEN = ROOT / "golden.yaml"
 BASELINE = ROOT / "baseline.json"
 LAST_RUN = ROOT / "last_run.json"
 def search(base_url: str, q: str, lang: str, top_k: int, rerank: str) -> list[dict]:
    params = {"q": q, "lang": lang, "top_k": str(top_k)}
    if rerank == "never":
        params["rerank"] = "never"
    elif rerank == "always":
        params["rerank"] = "always"
    qs = urllib.parse.urlencode(params)
    url = f"{base_url.rstrip('/')}/api/search/hybrid?{qs}"
    try:
        with urllib.request.urlopen(url, timeout=30) as r:
            data = json.loads(r.read())
        return data.get("hits", [])
    except urllib.error.HTTPError as e:
        sys.stderr.write(f"  ! HTTP {e.code} on {q!r}\n")
        return []
    except Exception as e:
        sys.stderr.write(f"  ! {e} on {q!r}\n")
        return []
 def evaluate(golden: list[dict], hits_by_id: dict[str, list[dict]], k: int) -> dict:
    """Per-query Recall@k + MRR. Negative-set queries (no expected chunks)
    pass when no hits are returned within the top-k."""
    per_query: list[dict] = []
    pos_recalls: list[float] = []
    pos_mrrs: list[float] = []
    neg_pass = 0
    neg_total = 0
    for q in golden:
        qid = q["id"]
        expected = {(e["doc"], e["chunk"]) for e in (q.get("expected_chunks") or [])}
        hits = hits_by_id.get(qid, [])
        topk = hits[:k]
        if not expected:
            # Negative-set: pass when fewer than k hits, OR when first hit is
            # weak enough that the model wouldn't latch onto it. We accept
            # any non-zero result count as failure to keep the metric strict.
            neg_total += 1
            ok = len(topk) == 0
            per_query.append({
                "id": qid, "negative": True, "ok": ok,
                "n_hits": len(topk),
            })
            if ok:
                neg_pass += 1
            continue
        present = sum(1 for h in topk if (h.get("doc_id"), h.get("chunk_id")) in expected)
        recall = present / len(expected)
        # MRR — first matching position (1-indexed). 0 if none.
        rr = 0.0
        for i, h in enumerate(topk, start=1):
            if (h.get("doc_id"), h.get("chunk_id")) in expected:
                rr = 1.0 / i
                break
        per_query.append({
            "id": qid, "negative": False,
            "recall_at_k": round(recall, 4),
            "mrr": round(rr, 4),
            "n_expected": len(expected),
            "n_present": present,
        })
        pos_recalls.append(recall)
        pos_mrrs.append(rr)
    return {
        "k": k,
        "n_queries": len(per_query),
        "n_positive": len(pos_recalls),
        "n_negative": neg_total,
        "recall_at_k": round(sum(pos_recalls) / len(pos_recalls), 4) if pos_recalls else 0.0,
        "mrr": round(sum(pos_mrrs) / len(pos_mrrs), 4) if pos_mrrs else 0.0,
        "negative_pass_rate": round(neg_pass / neg_total, 4) if neg_total else 1.0,
        "per_query": per_query,
    }
 def main() -> int:
    ap = argparse.ArgumentParser()
    ap.add_argument("--url", default="https://disclosure.top",
                    help="Base URL of the deployment to evaluate")
    ap.add_argument("--top-k", type=int, default=5)
    ap.add_argument("--rerank", choices=["always", "when_top_k_gt", "never"],
                    default="when_top_k_gt")
    ap.add_argument("--refresh-baseline", action="store_true",
                    help="Overwrite baseline.json with this run (acknowledged regression).")
    ap.add_argument("--max-recall-drop", type=float, default=0.05)
    args = ap.parse_args()
    data = yaml.safe_load(GOLDEN.read_text())
    queries = data["queries"]
    print(f"= running {len(queries)} queries against {args.url}  (k={args.top_k}, rerank={args.rerank})")
    hits_by_id = {}
    for q in queries:
        hits = search(args.url, q["question"], q.get("lang", "pt"),
                      top_k=max(args.top_k, 10), rerank=args.rerank)
        hits_by_id[q["id"]] = hits
        first = hits[0].get("chunk_id") if hits else "-"
        print(f"  {q['id']:24s} → {len(hits):2d} hits  (first={first})")
    report = evaluate(queries, hits_by_id, k=args.top_k)
    report["url"] = args.url
    report["top_k"] = args.top_k
    report["rerank"] = args.rerank
    LAST_RUN.write_text(json.dumps(report, indent=2))
    print(f"\n— wrote {LAST_RUN}")
    print(f"  Recall@{args.top_k} = {report['recall_at_k']:.4f}")
    print(f"  MRR              = {report['mrr']:.4f}")
    print(f"  Negative pass    = {report['negative_pass_rate']:.4f}")
    if args.refresh_baseline:
        BASELINE.write_text(json.dumps({
            "url": args.url, "top_k": args.top_k, "rerank": args.rerank,
            "recall_at_k": report["recall_at_k"],
            "mrr": report["mrr"],
            "negative_pass_rate": report["negative_pass_rate"],
        }, indent=2))
        print(f"\n✓ baseline refreshed: {BASELINE}")
        return 0
    if not BASELINE.exists():
        print("\n! no baseline yet — run with --refresh-baseline to create one")
        return 0
    baseline = json.loads(BASELINE.read_text())
    drop = baseline["recall_at_k"] - report["recall_at_k"]
    print(f"\n  baseline Recall@{args.top_k} = {baseline['recall_at_k']:.4f}  (Δ {-drop:+.4f})")
    if drop > args.max_recall_drop:
        print(f"\n✗ GATE FAILED: Recall@{args.top_k} dropped {drop:.4f} > {args.max_recall_drop}")
        return 1
    print(f"\n✓ gate passed (drop ≤ {args.max_recall_drop})")
    return 0
 if __name__ == "__main__":
    sys.exit(main())
--- a/web/app/api/search/hybrid/route.ts
+++ b/web/app/api/search/hybrid/route.ts
@ -28,10 +28,26 @@ export async function GET(req: NextRequest) {
  const type = u.searchParams.get("type") || null;
  const top_k = Math.min(Number(u.searchParams.get("top_k") ?? 10), 50);
  const ufo_only = u.searchParams.get("ufo_only") === "1";
-  const no_rerank = u.searchParams.get("rerank") === "0";
+  // W2-TD#8: rerank now has three modes. Back-compat: `rerank=0` keeps the
  // old "never" shortcut. New: `rerank=always|when_top_k_gt|never` and
  // `rerank_threshold=N` (default 15). Default strategy is `when_top_k_gt`.
  const rerankParam = u.searchParams.get("rerank");
  const no_rerank = rerankParam === "0";
  const rerank_strategy = (
    rerankParam === "always" || rerankParam === "never" || rerankParam === "when_top_k_gt"
      ? rerankParam
      : "when_top_k_gt"
  ) as "always" | "when_top_k_gt" | "never";
  const rerank_threshold = Math.max(
    1,
    Math.min(50, Number(u.searchParams.get("rerank_threshold") ?? 15)),
  );
  try {
-    const hits = await hybridSearch({ query: q, lang, doc_id, type, ufo_only, top_k, no_rerank });
+    const hits = await hybridSearch({
      query: q, lang, doc_id, type, ufo_only, top_k, no_rerank,
      rerank_strategy, rerank_threshold,
    });
    return json({
      query: q,
      lang,
--- a/web/app/api/sessions/[id]/messages/route.ts
+++ b/web/app/api/sessions/[id]/messages/route.ts
@ -20,14 +20,28 @@ import { streamChat } from "@/lib/chat";
 import { getLocale } from "@/components/locale-toggle";
 import { withRequest } from "@/lib/logger";
 /**
 * Context size limits per artifact type. Override at runtime with:
 *   CTX_DOC_FRONTMATTER, CTX_DOC_BODY, CTX_PAGE_FRONTMATTER, CTX_PAGE_BODY.
 * W2-TD#27 — was hard-coded 1200 / 1500 / 1500 / 1500. Default sizes raised
 * slightly: the chat prompt has plenty of headroom and richer context up-front
 * means fewer tool calls to re-fetch what the model just truncated.
 */
 const CTX = {
  doc_frontmatter: Number(process.env.CTX_DOC_FRONTMATTER || 1500),
  doc_body:        Number(process.env.CTX_DOC_BODY        || 3000),
  page_frontmatter: Number(process.env.CTX_PAGE_FRONTMATTER || 1800),
  page_body:        Number(process.env.CTX_PAGE_BODY        || 2000),
 } as const;
 async function gatherContext(docId: string | null, pageId: string | null): Promise<string> {
  const parts: string[] = [];
  if (docId) {
    const d = await readDocument(docId);
    if (d) {
      parts.push(`# Current document: ${docId}\n` +
-                 `Frontmatter: ${JSON.stringify(d.fm, null, 2).slice(0, 1200)}\n\n` +
+                 `Frontmatter: ${JSON.stringify(d.fm, null, 2).slice(0, CTX.doc_frontmatter)}\n\n` +
-                 `Body excerpt:\n${d.body.slice(0, 1500)}`);
+                 `Body excerpt:\n${d.body.slice(0, CTX.doc_body)}`);
    }
  }
  if (pageId) {
@ -36,8 +50,8 @@ async function gatherContext(docId: string | null, pageId: string | null): Promi
      const md = await readPage(d, p);
      if (md) {
        parts.push(`# Current page: ${pageId}\n` +
-                   `Frontmatter: ${JSON.stringify(md.fm, null, 2).slice(0, 1500)}\n\n` +
+                   `Frontmatter: ${JSON.stringify(md.fm, null, 2).slice(0, CTX.page_frontmatter)}\n\n` +
-                   `Body excerpt:\n${md.body.slice(0, 1500)}`);
+                   `Body excerpt:\n${md.body.slice(0, CTX.page_body)}`);
      }
    }
  }
--- a/web/app/graph/page.tsx
+++ b/web/app/graph/page.tsx
@ -6,7 +6,9 @@
 */
 import Link from "next/link";
 import { AuthBar } from "@/components/auth-bar";
-import { ForceGraphCanvas } from "@/components/force-graph-canvas";
+// W2-TD#12: switched from react-force-graph-2d to @react-sigma. One graph
 // library is enough; sigma is the one already used by the entity sidebar.
 import { SigmaGraph } from "@/components/sigma-graph";
 export const dynamic = "force-dynamic";
@ -46,7 +48,7 @@ export default function GraphPage() {
      {/* Fullscreen canvas */}
      <div className="absolute inset-0">
-        <ForceGraphCanvas />
+        <SigmaGraph />
      </div>
    </main>
  );
--- a/web/components/force-graph-canvas.tsx
+++ b/web/components/force-graph-canvas.tsx
@ -1,596 +0,0 @@
 /**
 * ForceGraphCanvas — D3 force-directed entity graph (Obsidian-style).
 *
 * Layout:
 *   - Left sidebar: filters (classes, limit) — sempre visível, fora do canvas
 *   - Right side panel: detalhe da entidade selecionada (quando clica num nó)
 *   - Center: canvas fullscreen com nodes coloridos por classe + edges
 *     coloridas por peso (low=cinza, mid=cyan, high=verde)
 *
 * Interação:
 *   - HOVER: tooltip flutuante com nome + classe + mentions
 *   - CLICK: abre side panel direito com info da entidade + top neighbors + botão "abrir página"
 *   - DOUBLE-CLICK: navega direto para /e/<class>/<id>
 *   - Scroll: zoom; drag canvas: pan
 */
 "use client";
 import { useCallback, useEffect, useMemo, useRef, useState } from "react";
 import dynamic from "next/dynamic";
 import Link from "next/link";
 const ForceGraph2D = dynamic(() => import("react-force-graph-2d"), { ssr: false });
 interface RawNode {
  entity_pk: number;
  entity_class: string;
  entity_id: string;
  canonical_name: string;
  total_mentions: number;
  documents_count: number;
 }
 interface RawLink {
  source: number;
  target: number;
  weight: number;
 }
 interface GraphNode extends RawNode {
  id: number;
  x?: number;
  y?: number;
  vx?: number;
  vy?: number;
 }
 interface GraphLink {
  source: number | GraphNode;
  target: number | GraphNode;
  weight: number;
 }
 const CLASS_COLOR: Record<string, string> = {
  person: "#ff6ec7",
  organization: "#ff8a4d",
  location: "#3fde6a",
  event: "#ffa500",
  uap_object: "#ff3344",
  vehicle: "#5b9bd5",
  operation: "#9b5de5",
  concept: "#06d6a0",
 };
 const CLASS_FOLDER: Record<string, string> = {
  person: "people",
  organization: "organizations",
  location: "locations",
  event: "events",
  uap_object: "uap-objects",
  vehicle: "vehicles",
  operation: "operations",
  concept: "concepts",
 };
 const CLASS_LABEL: Record<string, string> = {
  person: "Pessoas",
  organization: "Organizações",
  location: "Locais",
  event: "Eventos",
  uap_object: "UAP",
  vehicle: "Veículos",
  operation: "Operações",
  concept: "Conceitos",
 };
 const ALL_CLASSES = ["person", "organization", "location", "event", "uap_object", "vehicle", "operation", "concept"];
 /** Color edge by weight tier — visual diferenciação por intensidade */
 function edgeColor(weight: number): string {
  if (weight >= 10) return "rgba(0,255,156,0.55)"; // strong: green
  if (weight >= 5) return "rgba(127,219,255,0.45)"; // medium: cyan
  if (weight >= 3) return "rgba(167,139,250,0.35)"; // mild: purple
  return "rgba(127,219,255,0.18)"; // weak: faded cyan
 }
 function edgeWidth(weight: number): number {
  return Math.max(0.5, Math.min(6, Math.log2(weight + 1) * 1.2));
 }
 interface EntityDetail {
  entity_pk: number;
  entity_class: string;
  entity_id: string;
  canonical_name: string;
  total_mentions: number;
  documents_count: number;
  neighbors: Array<{
    entity_pk: number;
    entity_class: string;
    entity_id: string;
    canonical_name: string;
    weight: number;
    total_mentions: number;
  }>;
 }
 const detailCache = new Map<number, EntityDetail>();
 interface ForceGraph2DRef {
  d3Force: (name: string) => { strength?: (v: number) => unknown; distance?: (v: number) => unknown } | null;
  d3ReheatSimulation: () => void;
  zoomToFit: (durationMs?: number, paddingPx?: number) => void;
  centerAt: (x?: number, y?: number, durationMs?: number) => void;
 }
 export function ForceGraphCanvas() {
  const fgRef = useRef<ForceGraph2DRef | null>(null);
  const [nodes, setNodes] = useState<GraphNode[]>([]);
  const [links, setLinks] = useState<GraphLink[]>([]);
  const [selectedClasses, setSelectedClasses] = useState<Set<string>>(new Set(ALL_CLASSES));
  const [loading, setLoading] = useState(true);
  const [hoverNode, setHoverNode] = useState<GraphNode | null>(null);
  const [hoverPos, setHoverPos] = useState<{ x: number; y: number } | null>(null);
  const [selectedNode, setSelectedNode] = useState<GraphNode | null>(null);
  const [detail, setDetail] = useState<EntityDetail | null>(null);
  const [detailLoading, setDetailLoading] = useState(false);
  const [limit, setLimit] = useState(40);
  const [minWeight, setMinWeight] = useState(3);
  const [search, setSearch] = useState("");
  // Tune d3-force after the graph mounts and on data change — STRONGER repulsion + LONGER links
  useEffect(() => {
    const fg = fgRef.current;
    if (!fg) return;
    const charge = fg.d3Force("charge");
    if (charge?.strength) charge.strength(-450);
    const link = fg.d3Force("link");
    if (link?.distance) link.distance(120);
    const center = fg.d3Force("center");
    if (center?.strength) center.strength(0.04);
    fg.d3ReheatSimulation();
    setTimeout(() => fg.zoomToFit?.(800, 80), 1500);
  }, [nodes.length, links.length]);
  // Initial seed load — re-runs when filters change
  useEffect(() => {
    setLoading(true);
    const classesParam = Array.from(selectedClasses).join(",");
    fetch(`/api/graph/seed?limit=${limit}&min_weight=${minWeight}&classes=${classesParam}`)
      .then((r) => r.json())
      .then((data: { nodes?: RawNode[]; links?: RawLink[] }) => {
        const ns = (data.nodes ?? []).map((n) => ({ ...n, id: n.entity_pk } as GraphNode));
        const ls = (data.links ?? []).map((l) => ({ source: l.source, target: l.target, weight: l.weight } as GraphLink));
        setNodes(ns);
        setLinks(ls);
        setLoading(false);
      })
      .catch(() => setLoading(false));
  }, [limit, minWeight, selectedClasses]);
  // Fetch detail when node selected
  useEffect(() => {
    if (!selectedNode) {
      setDetail(null);
      return;
    }
    const cached = detailCache.get(selectedNode.entity_pk);
    if (cached) {
      setDetail(cached);
      return;
    }
    setDetail(null);
    setDetailLoading(true);
    fetch(
      `/api/graph?op=neighbors&class=${selectedNode.entity_class}&id=${encodeURIComponent(selectedNode.entity_id)}&limit=12`,
    )
      .then((r) => r.json())
      .then((data: { entity?: RawNode; neighbors?: EntityDetail["neighbors"] }) => {
        const d: EntityDetail = {
          entity_pk: selectedNode.entity_pk,
          entity_class: selectedNode.entity_class,
          entity_id: selectedNode.entity_id,
          canonical_name: selectedNode.canonical_name,
          total_mentions: data.entity?.total_mentions ?? selectedNode.total_mentions,
          documents_count: data.entity?.documents_count ?? selectedNode.documents_count,
          neighbors: data.neighbors ?? [],
        };
        detailCache.set(selectedNode.entity_pk, d);
        setDetail(d);
      })
      .catch(() => setDetail(null))
      .finally(() => setDetailLoading(false));
  }, [selectedNode]);
  const onNodeClick = useCallback(async (node: GraphNode) => {
    setSelectedNode(node);
  }, []);
  const expandNode = useCallback(
    async (node: GraphNode) => {
      try {
        const r = await fetch(
          `/api/graph?op=neighbors&class=${node.entity_class}&id=${encodeURIComponent(node.entity_id)}&limit=15`,
        );
        if (!r.ok) return;
        const data = (await r.json()) as { neighbors?: Array<RawNode & { weight: number }> };
        if (!data.neighbors) return;
        setNodes((prev) => {
          const existing = new Set(prev.map((p) => p.id));
          const additions = data.neighbors!
            .filter((n) => !existing.has(n.entity_pk))
            .map((n) => ({ ...n, id: n.entity_pk } as GraphNode));
          return [...prev, ...additions];
        });
        setLinks((prev) => {
          const seen = new Set(
            prev.map((l) => {
              const s = typeof l.source === "object" ? (l.source as GraphNode).id : l.source;
              const t = typeof l.target === "object" ? (l.target as GraphNode).id : l.target;
              return `${Math.min(s, t)}-${Math.max(s, t)}`;
            }),
          );
          const additions: GraphLink[] = [];
          for (const n of data.neighbors!) {
            const a = node.entity_pk;
            const b = n.entity_pk;
            const key = `${Math.min(a, b)}-${Math.max(a, b)}`;
            if (!seen.has(key)) {
              additions.push({ source: a, target: b, weight: n.weight });
              seen.add(key);
            }
          }
          return [...prev, ...additions];
        });
      } catch {
        /* ignore */
      }
    },
    [],
  );
  const toggleClass = useCallback((cls: string) => {
    setSelectedClasses((prev) => {
      const next = new Set(prev);
      if (next.has(cls)) next.delete(cls);
      else next.add(cls);
      return next.size > 0 ? next : prev;
    });
  }, []);
  const visibleData = useMemo(() => {
    let filteredNodes = nodes.filter((n) => selectedClasses.has(n.entity_class));
    if (search.trim()) {
      const sl = search.toLowerCase();
      filteredNodes = filteredNodes.filter((n) =>
        n.canonical_name.toLowerCase().includes(sl) || n.entity_id.toLowerCase().includes(sl),
      );
    }
    const allowed = new Set(filteredNodes.map((n) => n.id));
    const filteredLinks = links.filter((l) => {
      const s = typeof l.source === "object" ? (l.source as GraphNode).id : l.source;
      const t = typeof l.target === "object" ? (l.target as GraphNode).id : l.target;
      return allowed.has(s) && allowed.has(t);
    });
    return { nodes: filteredNodes, links: filteredLinks };
  }, [nodes, links, selectedClasses, search]);
  return (
    <div className="relative w-full h-full bg-[#040810] overflow-hidden">
      {/* LEFT sidebar — filters (sempre visível, fora do z-30 do page header) */}
      <div className="absolute top-20 left-4 z-20 w-[240px] max-h-[calc(100vh-180px)] overflow-y-auto bg-[#0a121e]/95 backdrop-blur border border-[rgba(0,255,156,0.20)] rounded p-3 space-y-4">
        <div>
          <div className="font-mono text-[10px] uppercase tracking-widest text-[#5a6678] mb-2">
            🔍 buscar nó
          </div>
          <input
            value={search}
            onChange={(e) => setSearch(e.target.value)}
            placeholder="nome ou id..."
            className="w-full bg-transparent border border-[rgba(0,255,156,0.20)] focus:border-[#00ff9c] rounded px-2 py-1.5 font-mono text-xs text-[#c8d4e6] outline-none"
          />
        </div>
        <div>
          <div className="font-mono text-[10px] uppercase tracking-widest text-[#5a6678] mb-2">
            classes
          </div>
          <div className="space-y-1">
            {ALL_CLASSES.map((cls) => {
              const active = selectedClasses.has(cls);
              const color = CLASS_COLOR[cls] ?? "#7fdbff";
              return (
                <button
                  key={cls}
                  onClick={() => toggleClass(cls)}
                  className={`w-full flex items-center gap-2 px-2 py-1 font-mono text-[11px] rounded border transition ${
                    active ? "" : "opacity-30 hover:opacity-60"
                  }`}
                  style={{
                    color,
                    borderColor: color,
                    background: active ? `${color}12` : "transparent",
                  }}
                >
                  <span className="inline-block w-2 h-2 rounded-full" style={{ background: color }} />
                  <span className="flex-1 text-left">{CLASS_LABEL[cls]}</span>
                  {active && <span>✓</span>}
                </button>
              );
            })}
          </div>
        </div>
        <div>
          <div className="font-mono text-[10px] uppercase tracking-widest text-[#5a6678] mb-2">
            top entidades
          </div>
          <div className="flex flex-wrap gap-1">
            {[20, 40, 80, 150].map((n) => (
              <button
                key={n}
                onClick={() => setLimit(n)}
                className={`px-2 py-1 font-mono text-[11px] rounded border ${
                  limit === n
                    ? "border-[#00ff9c] text-[#00ff9c] bg-[rgba(0,255,156,0.10)]"
                    : "border-[rgba(127,219,255,0.20)] text-[#8896aa] hover:text-[#7fdbff]"
                }`}
              >
                {n}
              </button>
            ))}
          </div>
        </div>
        <div>
          <div className="font-mono text-[10px] uppercase tracking-widest text-[#5a6678] mb-2">
            mostrar vínculos com ≥
          </div>
          <div className="flex flex-wrap gap-1">
            {[2, 3, 5, 10].map((n) => (
              <button
                key={n}
                onClick={() => setMinWeight(n)}
                className={`px-2 py-1 font-mono text-[11px] rounded border ${
                  minWeight === n
                    ? "border-[#00ff9c] text-[#00ff9c] bg-[rgba(0,255,156,0.10)]"
                    : "border-[rgba(127,219,255,0.20)] text-[#8896aa] hover:text-[#7fdbff]"
                }`}
              >
                {n}×
              </button>
            ))}
          </div>
        </div>
        <div>
          <div className="font-mono text-[10px] uppercase tracking-widest text-[#5a6678] mb-2">
            força do vínculo
          </div>
          <div className="space-y-1 text-[10px] font-mono">
            <div className="flex items-center gap-2">
              <span className="inline-block w-6 h-0.5" style={{ background: "rgba(0,255,156,0.55)" }} />
              <span className="text-[#8896aa]">≥ 10 co-menções</span>
            </div>
            <div className="flex items-center gap-2">
              <span className="inline-block w-6 h-0.5" style={{ background: "rgba(127,219,255,0.45)" }} />
              <span className="text-[#8896aa]">5–9</span>
            </div>
            <div className="flex items-center gap-2">
              <span className="inline-block w-6 h-0.5" style={{ background: "rgba(167,139,250,0.35)" }} />
              <span className="text-[#8896aa]">3–4</span>
            </div>
            <div className="flex items-center gap-2">
              <span className="inline-block w-6 h-0.5" style={{ background: "rgba(127,219,255,0.18)" }} />
              <span className="text-[#8896aa]">2 (mín.)</span>
            </div>
          </div>
        </div>
        <div className="pt-2 border-t border-[rgba(0,255,156,0.10)] font-mono text-[10px] text-[#5a6678]">
          {loading ? "carregando…" : `${visibleData.nodes.length} nós · ${visibleData.links.length} arestas`}
        </div>
      </div>
      {/* RIGHT side panel — entidade selecionada */}
      {selectedNode && (
        <div className="absolute top-20 right-4 z-20 w-[340px] max-h-[calc(100vh-180px)] overflow-y-auto bg-[#0a121e]/95 backdrop-blur border-2 rounded p-4 space-y-4"
             style={{ borderColor: CLASS_COLOR[selectedNode.entity_class] ?? "#7fdbff" }}>
          <div className="flex items-start justify-between gap-2">
            <div className="flex-1 min-w-0">
              <div
                className="font-mono text-[10px] uppercase tracking-widest mb-1"
                style={{ color: CLASS_COLOR[selectedNode.entity_class] ?? "#7fdbff" }}
              >
                {CLASS_LABEL[selectedNode.entity_class] ?? selectedNode.entity_class}
              </div>
              <h3 className="font-mono text-base text-[#c8d4e6] font-bold leading-tight break-words">
                {selectedNode.canonical_name}
              </h3>
              <div className="font-mono text-[10px] text-[#5a6678] mt-1 truncate">
                {selectedNode.entity_id}
              </div>
            </div>
            <button
              onClick={() => setSelectedNode(null)}
              className="text-[#5a6678] hover:text-[#ff6b6b] flex-shrink-0"
              aria-label="fechar"
            >
              ✕
            </button>
          </div>
          <div className="grid grid-cols-2 gap-2">
            <div className="px-3 py-2 bg-[#060a13] border border-[rgba(0,255,156,0.20)] rounded">
              <div className="font-mono text-[9px] uppercase text-[#5a6678]">menções</div>
              <div className="font-mono text-lg text-[#00ff9c] mt-0.5">{selectedNode.total_mentions}</div>
            </div>
            <div className="px-3 py-2 bg-[#060a13] border border-[rgba(127,219,255,0.20)] rounded">
              <div className="font-mono text-[9px] uppercase text-[#5a6678]">documentos</div>
              <div className="font-mono text-lg text-[#7fdbff] mt-0.5">{selectedNode.documents_count}</div>
            </div>
          </div>
          {/* Action buttons */}
          <div className="space-y-1.5">
            <Link
              href={`/e/${CLASS_FOLDER[selectedNode.entity_class]}/${selectedNode.entity_id}`}
              className="block w-full px-3 py-2 font-mono text-xs uppercase tracking-widest border-2 border-[#00ff9c] text-[#00ff9c] bg-[rgba(0,255,156,0.08)] hover:bg-[rgba(0,255,156,0.18)] rounded text-center"
            >
              abrir página completa →
            </Link>
            <button
              onClick={() => expandNode(selectedNode)}
              className="block w-full px-3 py-2 font-mono text-xs uppercase tracking-widest border border-[#7fdbff] text-[#7fdbff] hover:bg-[rgba(127,219,255,0.10)] rounded"
            >
              + expandir vizinhos no grafo
            </button>
          </div>
          {/* Neighbors list */}
          <div>
            <div className="font-mono text-[10px] uppercase tracking-widest text-[#5a6678] mb-2">
              {detailLoading ? "carregando vizinhos…" : `top vínculos (${detail?.neighbors.length ?? 0})`}
            </div>
            {detail?.neighbors && detail.neighbors.length > 0 ? (
              <ul className="space-y-1">
                {detail.neighbors.map((n) => {
                  const color = CLASS_COLOR[n.entity_class] ?? "#7fdbff";
                  return (
                    <li key={n.entity_pk}>
                      <button
                        onClick={() => {
                          const folded = nodes.find((nn) => nn.entity_pk === n.entity_pk);
                          if (folded) {
                            setSelectedNode(folded);
                          } else {
                            // Inject into graph + select
                            const newNode: GraphNode = {
                              entity_pk: n.entity_pk,
                              entity_class: n.entity_class,
                              entity_id: n.entity_id,
                              canonical_name: n.canonical_name,
                              total_mentions: n.total_mentions,
                              documents_count: 0,
                              id: n.entity_pk,
                            };
                            setNodes((prev) => [...prev, newNode]);
                            setLinks((prev) => [...prev, { source: selectedNode.entity_pk, target: n.entity_pk, weight: n.weight }]);
                            setSelectedNode(newNode);
                          }
                        }}
                        className="group w-full flex items-center gap-2 text-left p-1.5 -mx-1.5 rounded hover:bg-[rgba(0,255,156,0.04)]"
                      >
                        <span
                          className="inline-block w-2 h-2 rounded-full flex-shrink-0"
                          style={{ background: color }}
                        />
                        <span className="text-[11px] text-[#c8d4e6] group-hover:text-[#00ff9c] flex-1 truncate">
                          {n.canonical_name}
                        </span>
                        <span className="font-mono text-[10px] text-[#5a6678] flex-shrink-0">×{n.weight}</span>
                      </button>
                    </li>
                  );
                })}
              </ul>
            ) : !detailLoading ? (
              <p className="font-mono text-[10px] text-[#5a6678] italic">sem co-menções</p>
            ) : null}
          </div>
          <p className="font-mono text-[9px] text-[#5a6678] pt-2 border-t border-[rgba(0,255,156,0.10)]">
            duplo-clique no nó: abre página da entidade · clique vizinho: foca nele
          </p>
        </div>
      )}
      {/* Hover tooltip — segue o mouse */}
      {hoverNode && hoverPos && (
        <div
          className="absolute z-30 pointer-events-none px-2.5 py-1.5 bg-[#0a121e] border-2 rounded text-xs font-mono"
          style={{
            left: hoverPos.x + 12,
            top: hoverPos.y + 12,
            borderColor: CLASS_COLOR[hoverNode.entity_class] ?? "#7fdbff",
          }}
        >
          <div className="text-[#c8d4e6] font-bold">{hoverNode.canonical_name}</div>
          <div className="text-[10px] text-[#5a6678]">
            {hoverNode.entity_class} · {hoverNode.total_mentions} menções · {hoverNode.documents_count} docs
          </div>
          <div className="text-[9px] text-[#7fdbff] mt-0.5">clique para detalhes</div>
        </div>
      )}
      {/* Canvas */}
      <ForceGraph2D
        ref={fgRef as never}
        graphData={visibleData as never}
        backgroundColor="#040810"
        nodeRelSize={2.5}
        nodeVal={(n) => Math.max(1.5, Math.log2((n as GraphNode).total_mentions + 2) * 0.8)}
        nodeColor={(n) => CLASS_COLOR[(n as GraphNode).entity_class] ?? "#7fdbff"}
        nodeLabel={() => ""}
        linkColor={(l) => edgeColor((l as GraphLink).weight)}
        linkWidth={(l) => edgeWidth((l as GraphLink).weight)}
        // Physics — separa nós com força + arestas mais longas
        d3VelocityDecay={0.3}
        d3AlphaDecay={0.015}
        cooldownTicks={250}
        warmupTicks={60}
        // Use d3-force charge customization via dagMode? No, lib has limited API; rely on defaults + manual.
        onNodeClick={onNodeClick as never}
        onNodeHover={(n) => {
          setHoverNode(n as GraphNode | null);
          if (!n) setHoverPos(null);
        }}
        onBackgroundClick={() => setSelectedNode(null)}
        nodeCanvasObjectMode={() => "after"}
        nodeCanvasObject={(n, ctx, scale) => {
          const node = n as GraphNode;
          const isSelected = selectedNode?.entity_pk === node.entity_pk;
          const isHovered = hoverNode?.entity_pk === node.entity_pk;
          // Anti-clutter: with many nodes, mostrar label só:
          //   - quando zoomado (scale ≥ 1.5)
          //   - OU é hub (>200 mentions)
          //   - OU é hover/selected
          const isHub = node.total_mentions >= 200;
          const showLabel = isSelected || isHovered || isHub || scale >= 1.5;
          if (!showLabel) {
            if (isSelected) {
              ctx.beginPath();
              ctx.arc(node.x ?? 0, node.y ?? 0, 10 / scale, 0, 2 * Math.PI, false);
              ctx.strokeStyle = "#00ff9c";
              ctx.lineWidth = 2.5 / scale;
              ctx.stroke();
            }
            return;
          }
          const fontSize = Math.max(10, 14 / scale);
          ctx.font = `${isHub ? "bold " : ""}${fontSize}px sans-serif`;
          const label =
            node.canonical_name.length > 28
              ? node.canonical_name.slice(0, 26) + "…"
              : node.canonical_name;
          const tw = ctx.measureText(label).width;
          const pad = 4 / scale;
          // Background pill behind text — readability
          ctx.fillStyle = isSelected ? "rgba(0,255,156,0.85)" : "rgba(10,18,30,0.85)";
          ctx.fillRect(
            (node.x ?? 0) - tw / 2 - pad,
            (node.y ?? 0) + 8 / scale,
            tw + pad * 2,
            fontSize + pad,
          );
          ctx.fillStyle = isSelected ? "#040810" : "#c8d4e6";
          ctx.textAlign = "center";
          ctx.textBaseline = "top";
          ctx.fillText(label, node.x ?? 0, (node.y ?? 0) + 8 / scale + pad / 2);
          // Selected ring
          if (isSelected) {
            ctx.beginPath();
            ctx.arc(node.x ?? 0, node.y ?? 0, 10 / scale, 0, 2 * Math.PI, false);
            ctx.strokeStyle = "#00ff9c";
            ctx.lineWidth = 2.5 / scale;
            ctx.stroke();
          }
        }}
      />
    </div>
  );
 }
--- a/web/lib/chat/tools.ts
+++ b/web/lib/chat/tools.ts
@ -314,6 +314,39 @@ const co_mention_chunks_tool: ToolDefinition = {
  },
 };
 const analyze_image_region_tool: ToolDefinition = {
  type: "function",
  function: {
    name: "analyze_image_region",
    description:
      "Vision tool — answer a question about a cropped region of a document page. " +
      "Use this when the user asks about a photograph, diagram, sketch, signature, " +
      "stamp, redaction, or any visual element where the chunk's text description " +
      "isn't enough. The model reads the actual pixels via Sonnet vision. " +
      "Get the bbox + page from a prior hybrid_search hit (each chunk carries bbox). " +
      "Cost: ~$0.005–$0.02 per call. Use sparingly; prefer hybrid_search first.",
    parameters: {
      type: "object",
      properties: {
        doc_id: { type: "string" },
        page: { type: "integer", description: "1-indexed page number" },
        bbox: {
          type: "object",
          description: "Normalized bbox (0..1) of the region to analyze.",
          properties: {
            x: { type: "number" }, y: { type: "number" },
            w: { type: "number" }, h: { type: "number" },
          },
          required: ["x", "y", "w", "h"],
        },
        question: { type: "string", description: "What you want to know about the image." },
        context: { type: "string", description: "Optional: prose context that grounds the model." },
      },
      required: ["doc_id", "page", "bbox", "question"],
    },
  },
 };
 const navigate_to_tool: ToolDefinition = {
  type: "function",
  function: {
@ -345,6 +378,7 @@ export const TOOL_DEFINITIONS: ToolDefinition[] = [
  read_document_tool,
  read_entity_tool,
  search_corpus_tool,
  analyze_image_region_tool,
  navigate_to_tool,
 ];
@ -398,6 +432,11 @@ async function handleHybridSearch(
      classification: (args.classification as string) || null,
      ufo_only: Boolean(args.ufo_only),
      top_k,
      // W2-TD#8: chat is latency-sensitive — skip rerank when ≤10 candidates.
      // The model only cites the first few hits anyway and BGE-Reranker
      // adds 5-8s on CPU. RRF order from the RPC is plenty for the head.
      rerank_strategy: "when_top_k_gt",
      rerank_threshold: 10,
    });
    // Emit one citation (+ optional crop_image) artifact per hit so the UI can
    // render inline cards next to the assistant text. Limit to top 6 to avoid
@ -684,6 +723,37 @@ async function handleNavigate(args: Record<string, unknown>): Promise<unknown> {
  return { ok: true, target, label };
 }
 async function handleAnalyzeImageRegion(
  args: Record<string, unknown>,
  ctx: ToolHandlerContext,
 ): Promise<unknown> {
  const doc_id = String(args.doc_id ?? "").trim();
  const page = Number(args.page);
  const bbox = args.bbox as { x: number; y: number; w: number; h: number } | undefined;
  const question = String(args.question ?? "").trim();
  if (!doc_id || !page || !bbox || !question) return { error: "missing_args" };
  try {
    const { analyzeImageRegion } = await import("./vision");
    const out = await analyzeImageRegion({
      doc_id, page, bbox, question,
      context: typeof args.context === "string" ? args.context : undefined,
      lang: ctx.lang === "en" ? "en" : "pt",
    });
    if (ctx.emitArtifact) {
      ctx.emitArtifact({
        kind: "crop_image",
        src: out.crop_url,
        doc_id, page,
        alt_en: question.slice(0, 120),
        alt_pt: question.slice(0, 120),
      });
    }
    return out;
  } catch (e) {
    return { error: "vision_failed", message: (e as Error).message };
  }
 }
 export const TOOL_HANDLERS: Record<string, ToolHandler> = {
  hybrid_search: handleHybridSearch,
  read_chunk: handleReadChunk,
@ -696,5 +766,6 @@ export const TOOL_HANDLERS: Record<string, ToolHandler> = {
  read_document: handleReadDocument,
  read_entity: handleReadEntity,
  search_corpus: handleSearch,
  analyze_image_region: handleAnalyzeImageRegion,
  navigate_to: handleNavigate,
 };
--- a/web/lib/chat/vision.ts
+++ b/web/lib/chat/vision.ts
@ -0,0 +1,165 @@
 /**
 * vision.ts — answer questions about an image region via Claude Code OAuth.
 *
 * Pattern matches the project's existing vision pipeline (02-vision-page.py):
 *   1. Crop the PNG of the requested page to the requested bbox.
 *   2. Spawn `claude -p --model sonnet --allowedTools Read` and instruct the
 *      model to Read the local PNG path and answer the user's question.
 *
 * Uses the user's Claude Code OAuth (Max 20x). Per W1.2 budget policy, the
 * agentic worker may use Opus 4.7 without hard cap, but `analyze_image_region`
 * runs synchronously in the chat path — keep it on Sonnet for latency.
 */
 import { spawn } from "node:child_process";
 import { mkdtemp, unlink, rmdir } from "node:fs/promises";
 import path from "node:path";
 import os from "node:os";
 import sharp from "sharp";
 import { PROCESSING } from "@/lib/wiki";
 const MODEL = process.env.VISION_MODEL || "sonnet";
 const TIMEOUT_MS = Number(process.env.VISION_TIMEOUT_MS || 120_000);
 export interface AnalyzeImageRegionArgs {
  doc_id: string;
  page: number;
  bbox: { x: number; y: number; w: number; h: number };
  question: string;
  /** Optional context to ground the model (e.g., "this is an FBI memo from 1947"). */
  context?: string;
  /** Output language hint. Defaults to "pt-br". */
  lang?: "pt" | "en";
 }
 export interface AnalyzeImageRegionResult {
  answer: string;
  model: string;
  duration_ms: number;
  bbox: { x: number; y: number; w: number; h: number };
  crop_url: string;
 }
 function pageFilename(page: number): string {
  return `p-${String(page).padStart(3, "0")}.png`;
 }
 /** Crop a bbox region of a page PNG and write the result to a temp file. */
 async function cropToTempFile(args: AnalyzeImageRegionArgs): Promise<{ path: string; dir: string }> {
  const sourcePath = path.join(PROCESSING, "png", args.doc_id, pageFilename(args.page));
  const meta = await sharp(sourcePath).metadata();
  const W = meta.width ?? 0;
  const H = meta.height ?? 0;
  if (W === 0 || H === 0) throw new Error(`source PNG unreadable: ${sourcePath}`);
  const left = Math.max(0, Math.round(args.bbox.x * W));
  const top = Math.max(0, Math.round(args.bbox.y * H));
  const width = Math.max(1, Math.min(W - left, Math.round(args.bbox.w * W)));
  const height = Math.max(1, Math.min(H - top, Math.round(args.bbox.h * H)));
  const dir = await mkdtemp(path.join(os.tmpdir(), "analyze-image-"));
  const filePath = path.join(dir, "crop.png");
  await sharp(sourcePath)
    .extract({ left, top, width, height })
    .resize({ width: Math.min(1024, width), withoutEnlargement: true })
    .png()
    .toFile(filePath);
  return { path: filePath, dir };
 }
 function buildPrompt(args: AnalyzeImageRegionArgs, cropPath: string): string {
  const lang = args.lang === "en" ? "English" : "Brazilian Portuguese (pt-br)";
  const ctx = args.context ? `\n\nContext: ${args.context}` : "";
  return [
    `Use the Read tool on this local PNG file: ${cropPath}`,
    "",
    `The image is a cropped region from document "${args.doc_id}", page ${args.page}.`,
    ctx,
    "",
    `Answer this question about what is visible in the image, in ${lang}:`,
    "",
    args.question,
    "",
    "Rules:",
    "- Be factual and concise (3-8 sentences unless the question requires more).",
    "- If text is visible, transcribe the relevant portion verbatim (do not translate).",
    "- If the image is unclear or empty, say so explicitly. Don't invent.",
    "- Do not call any tool besides Read on the provided path.",
  ].join("\n");
 }
 /** Spawn `claude -p` and return the JSON output result. */
 function callClaudeCli(prompt: string): Promise<{ result: string; durationMs: number; costUsd?: number; tokensIn?: number; tokensOut?: number }> {
  return new Promise((resolve, reject) => {
    const t0 = Date.now();
    const child = spawn(
      "claude",
      [
        "-p",
        "--model", MODEL,
        "--output-format", "json",
        "--max-turns", "2",
        "--allowedTools", "Read",
        "--",
        prompt,
      ],
      { stdio: ["ignore", "pipe", "pipe"], env: { ...process.env } },
    );
    let stdout = "";
    let stderr = "";
    child.stdout.on("data", (c) => (stdout += c.toString()));
    child.stderr.on("data", (c) => (stderr += c.toString()));
    const t = setTimeout(() => {
      child.kill("SIGKILL");
      reject(new Error(`claude CLI timeout > ${TIMEOUT_MS}ms`));
    }, TIMEOUT_MS);
    child.on("error", (e) => { clearTimeout(t); reject(e); });
    child.on("close", (code) => {
      clearTimeout(t);
      if (code !== 0) {
        return reject(new Error(`claude CLI rc=${code}: ${stderr.slice(-300)}`));
      }
      try {
        const cli = JSON.parse(stdout);
        if (cli.is_error) return reject(new Error(`claude error: ${(cli.result || "").slice(0, 300)}`));
        resolve({
          result: cli.result || "",
          durationMs: cli.duration_ms || Date.now() - t0,
          costUsd: cli.total_cost_usd,
          tokensIn: cli.usage?.input_tokens,
          tokensOut: cli.usage?.output_tokens,
        });
      } catch (e) {
        reject(new Error(`claude stdout parse: ${e instanceof Error ? e.message : String(e)}`));
      }
    });
  });
 }
 export async function analyzeImageRegion(args: AnalyzeImageRegionArgs): Promise<AnalyzeImageRegionResult> {
  if (!args.doc_id) throw new Error("doc_id required");
  if (!Number.isFinite(args.page) || args.page < 1) throw new Error("page must be >= 1");
  if (!args.bbox || !["x", "y", "w", "h"].every((k) => Number.isFinite((args.bbox as Record<string, unknown>)[k]))) {
    throw new Error("bbox {x,y,w,h} required (normalized 0..1)");
  }
  if (!args.question?.trim()) throw new Error("question required");
  const { path: cropPath, dir } = await cropToTempFile(args);
  try {
    const t0 = Date.now();
    const prompt = buildPrompt(args, cropPath);
    const out = await callClaudeCli(prompt);
    const cropUrl =
      `/api/crop?doc=${encodeURIComponent(args.doc_id)}` +
      `&page=${args.page}&x=${args.bbox.x}&y=${args.bbox.y}&w=${args.bbox.w}&h=${args.bbox.h}&w_px=640`;
    return {
      answer: out.result.trim(),
      model: MODEL,
      duration_ms: out.durationMs || Date.now() - t0,
      bbox: args.bbox,
      crop_url: cropUrl,
    };
  } finally {
    // Best-effort cleanup. Crop is in $TMPDIR, OS will reap if we miss.
    unlink(cropPath).catch(() => undefined);
    rmdir(dir).catch(() => undefined);
  }
 }
--- a/web/lib/retrieval/hybrid.ts
+++ b/web/lib/retrieval/hybrid.ts
@ -43,8 +43,24 @@ export interface HybridSearchOptions {
  recall_k?: number;
  /** Final list size after rerank (default 20). */
  top_k?: number;
-  /** Skip reranker (faster, lower precision). */
+  /** Skip reranker (faster, lower precision). Back-compat shortcut for
   *  `rerank_strategy: "never"`. */
  no_rerank?: boolean;
  /**
   * W2-TD#8: rerank policy.
   *   - "always"        — always run the cross-encoder (highest precision,
   *                       slowest, 5–8s on CPU)
   *   - "when_top_k_gt" — rerank only when `top_k > rerank_threshold`
   *                       (default threshold 15). RRF order from the RPC is
   *                       usually good enough for the tight head of results;
   *                       the reranker pays off when re-sorting a wider list.
   *                       This is the new default — autocomplete / chat
   *                       top-10 calls now skip rerank for free.
   *   - "never"         — same as `no_rerank: true`.
   */
  rerank_strategy?: "always" | "when_top_k_gt" | "never";
  /** Threshold for `when_top_k_gt`. Default 15 (per ADR-001). */
  rerank_threshold?: number;
 }
 export async function hybridSearch(opts: HybridSearchOptions): Promise<ChunkHit[]> {
@ -58,8 +74,14 @@ export async function hybridSearch(opts: HybridSearchOptions): Promise<ChunkHit[
    recall_k = 100,
    top_k = 20,
    no_rerank = false,
    rerank_strategy = "when_top_k_gt",
    rerank_threshold = 15,
  } = opts;
  // Effective strategy: explicit `no_rerank=true` always wins (back-compat).
  const strategy: "always" | "when_top_k_gt" | "never" =
    no_rerank ? "never" : rerank_strategy;
  if (!query.trim()) return [];
  // 1. Embed the query
@ -89,9 +111,13 @@ export async function hybridSearch(opts: HybridSearchOptions): Promise<ChunkHit[
  if (rows.length === 0) return [];
  // 3. Optional cross-encoder rerank for finer ordering. It's CPU-slow
-  // (seconds per ~dozen candidates), so it's opt-in (rerank=1); the default
+  // (seconds per ~dozen candidates). Strategy resolution (W2-TD#8 / ADR-001):
-  // fast path trusts the RPC's RRF order over the already-gated candidates.
+  //   - "never"         → skip
-  if (no_rerank) {
+  //   - "when_top_k_gt" → skip when top_k ≤ threshold (RRF is good enough
  //                       for a small head)
  //   - "always"        → run unconditionally
  if (strategy === "never" ||
      (strategy === "when_top_k_gt" && top_k <= rerank_threshold)) {
    return rows.slice(0, top_k);
  }
--- a/web/package-lock.json
+++ b/web/package-lock.json
@ -26,7 +26,6 @@
        "pino": "^10.3.1",
        "react": "^19.0.0",
        "react-dom": "^19.0.0",
        "react-force-graph-2d": "^1.27.0",
        "react-markdown": "^9.0.0",
        "remark-gfm": "^4.0.0",
        "remark-wiki-link": "^2.0.1",
@ -5993,12 +5992,6 @@
        "tslib": "^2.8.0"
      }
    },
    "node_modules/@tweenjs/tween.js": {
      "version": "25.0.0",
      "resolved": "https://registry.npmjs.org/@tweenjs/tween.js/-/tween.js-25.0.0.tgz",
      "integrity": "sha512-XKLA6syeBUaPzx4j3qwMqzzq+V4uo72BnlbOjmuljLrRqdsd3qnzvZZoxvMHZ23ndsRS4aufU6JOZYpCbU6T1A==",
      "license": "MIT"
    },
    "node_modules/@types/connect": {
      "version": "3.4.38",
      "resolved": "https://registry.npmjs.org/@types/connect/-/connect-3.4.38.tgz",
@ -6135,15 +6128,6 @@
      "integrity": "sha512-mUFwbeTqrVgDQxFveS+df2yfap6iuP20NAKAsBt5jDEoOTDew+zwLAOilHCeQJOVSvmgCX4ogqIrA0mnyr08yQ==",
      "license": "ISC"
    },
    "node_modules/accessor-fn": {
      "version": "1.5.3",
      "resolved": "https://registry.npmjs.org/accessor-fn/-/accessor-fn-1.5.3.tgz",
      "integrity": "sha512-rkAofCwe/FvYFUlMB0v0gWmhqtfAtV1IUkdPbfhTUyYniu5LrC0A0UJkTH0Jv3S8SvwkmfuAlY+mQIJATdocMA==",
      "license": "MIT",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/acorn": {
      "version": "8.16.0",
      "resolved": "https://registry.npmjs.org/acorn/-/acorn-8.16.0.tgz",
@ -6335,16 +6319,6 @@
        "node": ">=6.0.0"
      }
    },
    "node_modules/bezier-js": {
      "version": "6.1.4",
      "resolved": "https://registry.npmjs.org/bezier-js/-/bezier-js-6.1.4.tgz",
      "integrity": "sha512-PA0FW9ZpcHbojUCMu28z9Vg/fNkwTj5YhusSAjHHDfHDGLxJ6YUKrAN2vk1fP2MMOxVw4Oko16FMlRGVBGqLKg==",
      "license": "MIT",
      "funding": {
        "type": "individual",
        "url": "https://github.com/Pomax/bezierjs/blob/master/FUNDING.md"
      }
    },
    "node_modules/binary-extensions": {
      "version": "2.3.0",
      "resolved": "https://registry.npmjs.org/binary-extensions/-/binary-extensions-2.3.0.tgz",
@ -6446,18 +6420,6 @@
      ],
      "license": "CC-BY-4.0"
    },
    "node_modules/canvas-color-tracker": {
      "version": "1.3.2",
      "resolved": "https://registry.npmjs.org/canvas-color-tracker/-/canvas-color-tracker-1.3.2.tgz",
      "integrity": "sha512-ryQkDX26yJ3CXzb3hxUVNlg1NKE4REc5crLBq661Nxzr8TNd236SaEf2ffYLXyI5tSABSeguHLqcVq4vf9L3Zg==",
      "license": "MIT",
      "dependencies": {
        "tinycolor2": "^1.6.0"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/ccount": {
      "version": "2.0.1",
      "resolved": "https://registry.npmjs.org/ccount/-/ccount-2.0.1.tgz",
@ -6664,222 +6626,6 @@
      "dev": true,
      "license": "MIT"
    },
    "node_modules/d3-array": {
      "version": "3.2.4",
      "resolved": "https://registry.npmjs.org/d3-array/-/d3-array-3.2.4.tgz",
      "integrity": "sha512-tdQAmyA18i4J7wprpYq8ClcxZy3SC31QMeByyCFyRt7BVHdREQZ5lpzoe5mFEYZUWe+oq8HBvk9JjpibyEV4Jg==",
      "license": "ISC",
      "dependencies": {
        "internmap": "1 - 2"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-binarytree": {
      "version": "1.0.2",
      "resolved": "https://registry.npmjs.org/d3-binarytree/-/d3-binarytree-1.0.2.tgz",
      "integrity": "sha512-cElUNH+sHu95L04m92pG73t2MEJXKu+GeKUN1TJkFsu93E5W8E9Sc3kHEGJKgenGvj19m6upSn2EunvMgMD2Yw==",
      "license": "MIT"
    },
    "node_modules/d3-color": {
      "version": "3.1.0",
      "resolved": "https://registry.npmjs.org/d3-color/-/d3-color-3.1.0.tgz",
      "integrity": "sha512-zg/chbXyeBtMQ1LbD/WSoW2DpC3I0mpmPdW+ynRTj/x2DAWYrIY7qeZIHidozwV24m4iavr15lNwIwLxRmOxhA==",
      "license": "ISC",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-dispatch": {
      "version": "3.0.1",
      "resolved": "https://registry.npmjs.org/d3-dispatch/-/d3-dispatch-3.0.1.tgz",
      "integrity": "sha512-rzUyPU/S7rwUflMyLc1ETDeBj0NRuHKKAcvukozwhshr6g6c5d8zh4c2gQjY2bZ0dXeGLWc1PF174P2tVvKhfg==",
      "license": "ISC",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-drag": {
      "version": "3.0.0",
      "resolved": "https://registry.npmjs.org/d3-drag/-/d3-drag-3.0.0.tgz",
      "integrity": "sha512-pWbUJLdETVA8lQNJecMxoXfH6x+mO2UQo8rSmZ+QqxcbyA3hfeprFgIT//HW2nlHChWeIIMwS2Fq+gEARkhTkg==",
      "license": "ISC",
      "dependencies": {
        "d3-dispatch": "1 - 3",
        "d3-selection": "3"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-ease": {
      "version": "3.0.1",
      "resolved": "https://registry.npmjs.org/d3-ease/-/d3-ease-3.0.1.tgz",
      "integrity": "sha512-wR/XK3D3XcLIZwpbvQwQ5fK+8Ykds1ip7A2Txe0yxncXSdq1L9skcG7blcedkOX+ZcgxGAmLX1FrRGbADwzi0w==",
      "license": "BSD-3-Clause",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-force-3d": {
      "version": "3.0.6",
      "resolved": "https://registry.npmjs.org/d3-force-3d/-/d3-force-3d-3.0.6.tgz",
      "integrity": "sha512-4tsKHUPLOVkyfEffZo1v6sFHvGFwAIIjt/W8IThbp08DYAsXZck+2pSHEG5W1+gQgEvFLdZkYvmJAbRM2EzMnA==",
      "license": "MIT",
      "dependencies": {
        "d3-binarytree": "1",
        "d3-dispatch": "1 - 3",
        "d3-octree": "1",
        "d3-quadtree": "1 - 3",
        "d3-timer": "1 - 3"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-format": {
      "version": "3.1.2",
      "resolved": "https://registry.npmjs.org/d3-format/-/d3-format-3.1.2.tgz",
      "integrity": "sha512-AJDdYOdnyRDV5b6ArilzCPPwc1ejkHcoyFarqlPqT7zRYjhavcT3uSrqcMvsgh2CgoPbK3RCwyHaVyxYcP2Arg==",
      "license": "ISC",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-interpolate": {
      "version": "3.0.1",
      "resolved": "https://registry.npmjs.org/d3-interpolate/-/d3-interpolate-3.0.1.tgz",
      "integrity": "sha512-3bYs1rOD33uo8aqJfKP3JWPAibgw8Zm2+L9vBKEHJ2Rg+viTR7o5Mmv5mZcieN+FRYaAOWX5SJATX6k1PWz72g==",
      "license": "ISC",
      "dependencies": {
        "d3-color": "1 - 3"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-octree": {
      "version": "1.1.0",
      "resolved": "https://registry.npmjs.org/d3-octree/-/d3-octree-1.1.0.tgz",
      "integrity": "sha512-F8gPlqpP+HwRPMO/8uOu5wjH110+6q4cgJvgJT6vlpy3BEaDIKlTZrgHKZSp/i1InRpVfh4puY/kvL6MxK930A==",
      "license": "MIT"
    },
    "node_modules/d3-quadtree": {
      "version": "3.0.1",
      "resolved": "https://registry.npmjs.org/d3-quadtree/-/d3-quadtree-3.0.1.tgz",
      "integrity": "sha512-04xDrxQTDTCFwP5H6hRhsRcb9xxv2RzkcsygFzmkSIOJy3PeRJP7sNk3VRIbKXcog561P9oU0/rVH6vDROAgUw==",
      "license": "ISC",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-scale": {
      "version": "4.0.2",
      "resolved": "https://registry.npmjs.org/d3-scale/-/d3-scale-4.0.2.tgz",
      "integrity": "sha512-GZW464g1SH7ag3Y7hXjf8RoUuAFIqklOAq3MRl4OaWabTFJY9PN/E1YklhXLh+OQ3fM9yS2nOkCoS+WLZ6kvxQ==",
      "license": "ISC",
      "dependencies": {
        "d3-array": "2.10.0 - 3",
        "d3-format": "1 - 3",
        "d3-interpolate": "1.2.0 - 3",
        "d3-time": "2.1.1 - 3",
        "d3-time-format": "2 - 4"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-scale-chromatic": {
      "version": "3.1.0",
      "resolved": "https://registry.npmjs.org/d3-scale-chromatic/-/d3-scale-chromatic-3.1.0.tgz",
      "integrity": "sha512-A3s5PWiZ9YCXFye1o246KoscMWqf8BsD9eRiJ3He7C9OBaxKhAd5TFCdEx/7VbKtxxTsu//1mMJFrEt572cEyQ==",
      "license": "ISC",
      "dependencies": {
        "d3-color": "1 - 3",
        "d3-interpolate": "1 - 3"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-selection": {
      "version": "3.0.0",
      "resolved": "https://registry.npmjs.org/d3-selection/-/d3-selection-3.0.0.tgz",
      "integrity": "sha512-fmTRWbNMmsmWq6xJV8D19U/gw/bwrHfNXxrIN+HfZgnzqTHp9jOmKMhsTUjXOJnZOdZY9Q28y4yebKzqDKlxlQ==",
      "license": "ISC",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-time": {
      "version": "3.1.0",
      "resolved": "https://registry.npmjs.org/d3-time/-/d3-time-3.1.0.tgz",
      "integrity": "sha512-VqKjzBLejbSMT4IgbmVgDjpkYrNWUYJnbCGo874u7MMKIWsILRX+OpX/gTk8MqjpT1A/c6HY2dCA77ZN0lkQ2Q==",
      "license": "ISC",
      "dependencies": {
        "d3-array": "2 - 3"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-time-format": {
      "version": "4.1.0",
      "resolved": "https://registry.npmjs.org/d3-time-format/-/d3-time-format-4.1.0.tgz",
      "integrity": "sha512-dJxPBlzC7NugB2PDLwo9Q8JiTR3M3e4/XANkreKSUxF8vvXKqm1Yfq4Q5dl8budlunRVlUUaDUgFt7eA8D6NLg==",
      "license": "ISC",
      "dependencies": {
        "d3-time": "1 - 3"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-timer": {
      "version": "3.0.1",
      "resolved": "https://registry.npmjs.org/d3-timer/-/d3-timer-3.0.1.tgz",
      "integrity": "sha512-ndfJ/JxxMd3nw31uyKoY2naivF+r29V+Lc0svZxe1JvvIRmi8hUsrMvdOwgS1o6uBHmiz91geQ0ylPP0aj1VUA==",
      "license": "ISC",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/d3-transition": {
      "version": "3.0.1",
      "resolved": "https://registry.npmjs.org/d3-transition/-/d3-transition-3.0.1.tgz",
      "integrity": "sha512-ApKvfjsSR6tg06xrL434C0WydLr7JewBB3V+/39RMHsaXTOG0zmt/OAXeng5M5LBm0ojmxJrpomQVZ1aPvBL4w==",
      "license": "ISC",
      "dependencies": {
        "d3-color": "1 - 3",
        "d3-dispatch": "1 - 3",
        "d3-ease": "1 - 3",
        "d3-interpolate": "1 - 3",
        "d3-timer": "1 - 3"
      },
      "engines": {
        "node": ">=12"
      },
      "peerDependencies": {
        "d3-selection": "2 - 3"
      }
    },
    "node_modules/d3-zoom": {
      "version": "3.0.0",
      "resolved": "https://registry.npmjs.org/d3-zoom/-/d3-zoom-3.0.0.tgz",
      "integrity": "sha512-b8AmV3kfQaqWAuacbPuNbL6vahnOJflOhexLzMMNLga62+/nh0JzvJ0aO/5a5MVgUFGS7Hu1P9P03o3fJkDCyw==",
      "license": "ISC",
      "dependencies": {
        "d3-dispatch": "1 - 3",
        "d3-drag": "2 - 3",
        "d3-interpolate": "1 - 3",
        "d3-selection": "2 - 3",
        "d3-transition": "2 - 3"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/debug": {
      "version": "4.4.3",
      "resolved": "https://registry.npmjs.org/debug/-/debug-4.4.3.tgz",
@ -7194,46 +6940,6 @@
        "url": "https://github.com/sponsors/sindresorhus"
      }
    },
    "node_modules/float-tooltip": {
      "version": "1.7.5",
      "resolved": "https://registry.npmjs.org/float-tooltip/-/float-tooltip-1.7.5.tgz",
      "integrity": "sha512-/kXzuDnnBqyyWyhDMH7+PfP8J/oXiAavGzcRxASOMRHFuReDtofizLLJsf7nnDLAfEaMW4pVWaXrAjtnglpEkg==",
      "license": "MIT",
      "dependencies": {
        "d3-selection": "2 - 3",
        "kapsule": "^1.16",
        "preact": "10"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/force-graph": {
      "version": "1.51.4",
      "resolved": "https://registry.npmjs.org/force-graph/-/force-graph-1.51.4.tgz",
      "integrity": "sha512-TdJ2KbkoiDQ7NIRx8IPGD0mAXXpLhamS7c+b7W98b0MHG7lphnda1VOQX/98UDTsttIAdH4TcP0l0MauSnLK8w==",
      "license": "MIT",
      "dependencies": {
        "@tweenjs/tween.js": "18 - 25",
        "accessor-fn": "1",
        "bezier-js": "3 - 6",
        "canvas-color-tracker": "^1.3",
        "d3-array": "1 - 3",
        "d3-drag": "2 - 3",
        "d3-force-3d": "2 - 3",
        "d3-scale": "1 - 4",
        "d3-scale-chromatic": "1 - 3",
        "d3-selection": "2 - 3",
        "d3-zoom": "2 - 3",
        "float-tooltip": "^1.7",
        "index-array-by": "1",
        "kapsule": "^1.16",
        "lodash-es": "4"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/forwarded-parse": {
      "version": "2.1.2",
      "resolved": "https://registry.npmjs.org/forwarded-parse/-/forwarded-parse-2.1.2.tgz",
@ -7509,30 +7215,12 @@
        "node": ">=18"
      }
    },
    "node_modules/index-array-by": {
      "version": "1.4.2",
      "resolved": "https://registry.npmjs.org/index-array-by/-/index-array-by-1.4.2.tgz",
      "integrity": "sha512-SP23P27OUKzXWEC/TOyWlwLviofQkCSCKONnc62eItjp69yCZZPqDQtr3Pw5gJDnPeUMqExmKydNZaJO0FU9pw==",
      "license": "MIT",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/inline-style-parser": {
      "version": "0.2.7",
      "resolved": "https://registry.npmjs.org/inline-style-parser/-/inline-style-parser-0.2.7.tgz",
      "integrity": "sha512-Nb2ctOyNR8DqQoR0OwRG95uNWIC0C1lCgf5Naz5H6Ji72KZ8OcFZLz2P5sNgwlyoJ8Yif11oMuYs5pBQa86csA==",
      "license": "MIT"
    },
    "node_modules/internmap": {
      "version": "2.0.3",
      "resolved": "https://registry.npmjs.org/internmap/-/internmap-2.0.3.tgz",
      "integrity": "sha512-5Hh7Y1wQbvY5ooGgPbDaL5iYLAPzMTUrjMulskHLH6wnv/A+1q5rgEaiuqEjB+oxGXIVZs1FF+R/KPN3ZSQYYg==",
      "license": "ISC",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/is-alphabetical": {
      "version": "2.0.1",
      "resolved": "https://registry.npmjs.org/is-alphabetical/-/is-alphabetical-2.0.1.tgz",
@ -7681,15 +7369,6 @@
      "integrity": "sha512-RHxMLp9lnKHGHRng9QFhRCMbYAcVpn69smSGcq3f36xjgVVWThj4qqLbTLlq7Ssj8B+fIQ1EuCEGI2lKsyQeIw==",
      "license": "ISC"
    },
    "node_modules/jerrypick": {
      "version": "1.1.2",
      "resolved": "https://registry.npmjs.org/jerrypick/-/jerrypick-1.1.2.tgz",
      "integrity": "sha512-YKnxXEekXKzhpf7CLYA0A+oDP8V0OhICNCr5lv96FvSsDEmrb0GKM776JgQvHTMjr7DTTPEVv/1Ciaw0uEWzBA==",
      "license": "MIT",
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/jiti": {
      "version": "1.21.7",
      "resolved": "https://registry.npmjs.org/jiti/-/jiti-1.21.7.tgz",
@ -7743,18 +7422,6 @@
        "node": ">=6"
      }
    },
    "node_modules/kapsule": {
      "version": "1.16.3",
      "resolved": "https://registry.npmjs.org/kapsule/-/kapsule-1.16.3.tgz",
      "integrity": "sha512-4+5mNNf4vZDSwPhKprKwz3330iisPrb08JyMgbsdFrimBCKNHecua/WBwvVg3n7vwx0C1ARjfhwIpbrbd9n5wg==",
      "license": "MIT",
      "dependencies": {
        "lodash-es": "4"
      },
      "engines": {
        "node": ">=12"
      }
    },
    "node_modules/kind-of": {
      "version": "6.0.3",
      "resolved": "https://registry.npmjs.org/kind-of/-/kind-of-6.0.3.tgz",
@ -7799,12 +7466,6 @@
        "url": "https://github.com/sponsors/sindresorhus"
      }
    },
    "node_modules/lodash-es": {
      "version": "4.18.1",
      "resolved": "https://registry.npmjs.org/lodash-es/-/lodash-es-4.18.1.tgz",
      "integrity": "sha512-J8xewKD/Gk22OZbhpOVSwcs60zhd95ESDwezOFuA3/099925PdHJ7OFHNTGtajL3AlZkykD32HykiMo+BIBI8A==",
      "license": "MIT"
    },
    "node_modules/longest-streak": {
      "version": "3.1.0",
      "resolved": "https://registry.npmjs.org/longest-streak/-/longest-streak-3.1.0.tgz",
@ -7815,18 +7476,6 @@
        "url": "https://github.com/sponsors/wooorm"
      }
    },
    "node_modules/loose-envify": {
      "version": "1.4.0",
      "resolved": "https://registry.npmjs.org/loose-envify/-/loose-envify-1.4.0.tgz",
      "integrity": "sha512-lyuxPGr/Wfhrlem2CL/UcnUc1zcqKAImBDzukY7Y5F/yQiNdko6+fRLevlw1HgMySw7f611UIY408EtxRSoK3Q==",
      "license": "MIT",
      "dependencies": {
        "js-tokens": "^3.0.0 || ^4.0.0"
      },
      "bin": {
        "loose-envify": "cli.js"
      }
    },
    "node_modules/lru-cache": {
      "version": "5.1.1",
      "resolved": "https://registry.npmjs.org/lru-cache/-/lru-cache-5.1.1.tgz",
@ -9511,6 +9160,7 @@
      "version": "4.1.1",
      "resolved": "https://registry.npmjs.org/object-assign/-/object-assign-4.1.1.tgz",
      "integrity": "sha512-rJgTQnkUnH1sFw8yT6VSU3zD3sWmu6sZhIseY8VX+GRu3P6F7Fu+JNDoXfklElbLJSnc3FUQHVe4cU5hj+BcUg==",
      "dev": true,
      "license": "MIT",
      "engines": {
        "node": ">=0.10.0"
@ -10023,16 +9673,6 @@
        "node": ">=0.10.0"
      }
    },
    "node_modules/preact": {
      "version": "10.29.1",
      "resolved": "https://registry.npmjs.org/preact/-/preact-10.29.1.tgz",
      "integrity": "sha512-gQCLc/vWroE8lIpleXtdJhTFDogTdZG9AjMUpVkDf2iTCNwYNWA+u16dL41TqUDJO4gm2IgrcMv3uTpjd4Pwmg==",
      "license": "MIT",
      "funding": {
        "type": "opencollective",
        "url": "https://opencollective.com/preact"
      }
    },
    "node_modules/process-warning": {
      "version": "5.0.0",
      "resolved": "https://registry.npmjs.org/process-warning/-/process-warning-5.0.0.tgz",
@ -10058,17 +9698,6 @@
        "node": ">=0.4.0"
      }
    },
    "node_modules/prop-types": {
      "version": "15.8.1",
      "resolved": "https://registry.npmjs.org/prop-types/-/prop-types-15.8.1.tgz",
      "integrity": "sha512-oj87CgZICdulUohogVAR7AjlC0327U4el4L6eAvOqCeudMDVU0NThNaV+b9Df4dXgSP1gXMTnPdhfe/2qDH5cg==",
      "license": "MIT",
      "dependencies": {
        "loose-envify": "^1.4.0",
        "object-assign": "^4.1.1",
        "react-is": "^16.13.1"
      }
    },
    "node_modules/property-information": {
      "version": "7.1.0",
      "resolved": "https://registry.npmjs.org/property-information/-/property-information-7.1.0.tgz",
@ -10248,44 +9877,6 @@
        "react": "^19.2.6"
      }
    },
    "node_modules/react-force-graph-2d": {
      "version": "1.29.1",
      "resolved": "https://registry.npmjs.org/react-force-graph-2d/-/react-force-graph-2d-1.29.1.tgz",
      "integrity": "sha512-1Rl/1Z3xy2iTHKj6a0jRXGyiI86xUti81K+jBQZ+Oe46csaMikp47L5AjrzA9hY9fNGD63X8ffrqnvaORukCuQ==",
      "license": "MIT",
      "dependencies": {
        "force-graph": "^1.51",
        "prop-types": "15",
        "react-kapsule": "^2.5"
      },
      "engines": {
        "node": ">=12"
      },
      "peerDependencies": {
        "react": "*"
      }
    },
    "node_modules/react-is": {
      "version": "16.13.1",
      "resolved": "https://registry.npmjs.org/react-is/-/react-is-16.13.1.tgz",
      "integrity": "sha512-24e6ynE2H+OKt4kqsOvNd8kBpV65zoxbA4BVsEOB3ARVWQki/DHzaUoC5KuON/BiccDaCCTZBuOcfZs70kR8bQ==",
      "license": "MIT"
    },
    "node_modules/react-kapsule": {
      "version": "2.5.7",
      "resolved": "https://registry.npmjs.org/react-kapsule/-/react-kapsule-2.5.7.tgz",
      "integrity": "sha512-kifAF4ZPD77qZKc4CKLmozq6GY1sBzPEJTIJb0wWFK6HsePJatK3jXplZn2eeAt3x67CDozgi7/rO8fNQ/AL7A==",
      "license": "MIT",
      "dependencies": {
        "jerrypick": "^1.1.1"
      },
      "engines": {
        "node": ">=12"
      },
      "peerDependencies": {
        "react": ">=16.13.1"
      }
    },
    "node_modules/react-markdown": {
      "version": "9.1.0",
      "resolved": "https://registry.npmjs.org/react-markdown/-/react-markdown-9.1.0.tgz",
@ -10991,12 +10582,6 @@
      "integrity": "sha512-P4nbQYQfePJxRSmY+v/KINxVucm4NF3p3s7pJveMTtom52FR4YGltUQLB8idDXwDDWW+eYrWDFbuzUnjoWHF7g==",
      "license": "MIT"
    },
    "node_modules/tinycolor2": {
      "version": "1.6.0",
      "resolved": "https://registry.npmjs.org/tinycolor2/-/tinycolor2-1.6.0.tgz",
      "integrity": "sha512-XPaBkWQJdsf3pLKJV9p4qN/S+fm2Oj8AIPo1BTUhg5oxkvm9+SVEGFdhyOz7tTdUTfvxMiAs4sp6/eZO2Ew+pw==",
      "license": "MIT"
    },
    "node_modules/tinyglobby": {
      "version": "0.2.16",
      "resolved": "https://registry.npmjs.org/tinyglobby/-/tinyglobby-0.2.16.tgz",
--- a/web/package.json
+++ b/web/package.json
@ -28,7 +28,6 @@
    "pino": "^10.3.1",
    "react": "^19.0.0",
    "react-dom": "^19.0.0",
    "react-force-graph-2d": "^1.27.0",
    "react-markdown": "^9.0.0",
    "remark-gfm": "^4.0.0",
    "remark-wiki-link": "^2.0.1",