golangLAKEHOUSE

Author	SHA1	Message	Date
root	ee2a40c505	audit-FULL: port phases 1/2/5/7 — only acceptance.ts (TS-only) remains skipped Closes 4 of the 5 phases the initial audit-FULL port left as deferred. The pattern: most "deferred" phases didn't actually need the un-ported Rust pieces — they were observer-mode by design and just needed to read existing on-disk artifacts. Phase 1 (schema validators) → ported via exec.Command: Invokes `go test ./internal/distillation/...` — the Go equivalent of Rust's `bun test auditor/schemas/distillation/`. New GoTestModule field on AuditFullOptions controls the package pattern; empty disables the invocation (test mode, prevents recursion when audit-full is invoked from inside `go test`). Phase 2 (evidence materialization) → ported as observer: Reads data/evidence/ directly and tallies rows + tier-1 source hits. Doesn't re-run the materializer (which is Rust-side TS). Emits p2_evidence_rows + p2_evidence_skips metrics matching Rust shape — drop-in audit_baselines.jsonl entries possible. Phase 5 (run summary) → ported as observer: Reads reports/distillation/{run_id}/summary.json + 5 stage receipts. Validates schema_version=1, run_hash sha256, git_commit 40-char hex, all stage receipts decode as JSON. Full schema validation (StageReceipt schema) is intentionally NOT ported — it would require porting the TS schemas/distillation/ validators in full; basic shape checks catch the load-bearing invariants. Phase 7 (replay log) → ported as observer: Reads data/_kb/replay_runs.jsonl, validates last 50 rows parse as JSON. Skips the live-replay invocation that Rust's phase 7 also does — porting Rust replay.ts is substantial and not in scope. The "log shape sanity" check is what audit-full actually needs; the live invocation is a separate concern. Phase 6 (acceptance gate) — STILL SKIPPED: Rust acceptance.ts is a TS-only fixture harness with bun-specific deps. Porting the fixtures (tests/fixtures/distillation/acceptance/) + the 22-invariant runner to Go is an ADR-worth undertaking. Documented in the header comment. Live-data probe (against /home/profit/lakehouse): Skips count: 4 → 1 (only phase 6). Required checks: 6/6 → 12/12 PASS. New metric: p2_evidence_rows=1055, BYTE-EQUAL to the Rust pipeline's collect.records_out from the latest summary.json. Cross-runtime parity now extends across phases 0/1/2/3/4/5/7. 6 new tests: - TestPhase2_EvidenceTallyFromOnDisk: row + tier-1-hit tallying - TestPhase5_FullSummaryFlow: complete run-summary fixture passes - TestPhase5_ShortRunHashCaught: bad run_hash fails required check - TestPhase7_ReplayLogReadsFromDisk: row-count reporting - TestPhase7_MalformedTailRowsCaught: structural parse failure - TestRunAuditFull_FullFixtureFlow updated to seed evidence/ + reports/distillation/ for the phases now wired. Cleanup: removed local sortStrings helper (replaced with sort.Strings now that `sort` is imported for phase 5's mtime-sort). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 02:35:13 -05:00
root	55b8c76a8c	distillation: audit-FULL pipeline port (phases 0/3/4) — cross-runtime metric parity verified Ports the metric-collection passes from scripts/distillation/audit_full.ts. The substrate that PRODUCES audit_baselines.jsonl entries — the half OPEN #2 left as "deferred to next wave" after the read/write substrate landed in ca142b9. Phase coverage: Phase 0 (file presence) ported Phase 1 (schema validators) skipped (Go's `go test` covers it) Phase 2 (materializer dry-run) deferred (Go materializer not yet ported) Phase 3 (scored-runs distribution) ported Phase 4 (contamination firewall) ported Phase 5 (receipts validation) deferred (Go run-summary JSON not yet emitted) Phase 6 (replay sanity) deferred (Go replay tool not ported) Phase 7 (run summary lineage) deferred (same) Cross-runtime parity verified end-to-end: Go-side audit-full against /home/profit/lakehouse produced metrics IDENTICAL to the last Rust-emitted audit_baselines.jsonl entry. All 8 ported metrics match byte-for-byte: p3_accepted=386, p3_partial=132, p3_rejected=57, p3_human=480, p4_sft_rows=353, p4_rag_rows=448, p4_pref_pairs=83, p4_total_quarantined=1325 6/6 required checks pass on live data. Components: - internal/distillation/audit_full.go: PhaseCheck struct (mirrors Rust shape), PhaseCheckReport aggregation, RunAuditFull orchestrator, auditPhase0/3/4 implementations, FormatAuditFullReport Markdown writer. - cmd/audit_full/main.go: CLI binary with -root, -out, -json, -append-baseline flags. Operators run "./bin/audit_full -append-baseline" to grow the longitudinal log alongside the Rust pipeline (entries are interchangeable — same envelope shape). - 6 new tests: empty-root failure handling, full-fixture clean PASS (locks all 8 metrics + all 6 required checks), SFT firewall contamination detection, preference self-pair detection, sig_hash regex correctness (rejects wrong-length + uppercase), Markdown formatter smoke. Live-data probe captured at reports/cutover/audit_full_go_vs_rust.md (linked from reports/cutover/SUMMARY.md). Same shape as the audit_baselines round-trip evidence — both Go-side ports of the distillation surface are now validated against real Rust data, not just fixtures. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 01:30:23 -05:00
root	0d4f033b34	audit_baselines: round-trip validation against live Rust data Same shape of proof as embed_parity.sh for the embed endpoint: take the just-shipped Go port (ca142b9) and validate it against the actual production data the Rust legacy emits, not just unit- test fixtures. Locks the cross-runtime parity that operators running mixed pipelines depend on. scripts/cutover/audit_baselines_validate.go: - Reads /home/profit/lakehouse/data/_kb/audit_baselines.jsonl - Parses every entry via the Go AuditBaseline struct - Round-trips the last entry: encode → decode → field-by-field equality check (catches any silently-dropped JSON keys) - Calls LoadLastBaseline against the live file (proves the public API works on real shapes, not just inline parsing) - Computes BuildAuditDriftTable(first → last) — full-window lineage drift over the captured baselines Live-data probe results (reports/cutover/audit_baselines_roundtrip.md): - 7 entries parse without error - Round-trip is byte-equal on every metric + every header field - Drift table fires the expected verdicts: - p2_evidence_rows 12→82 (+583%) → warn (above 20% threshold) - p3_accepted/partial/rejected/human 0→non-zero → warn (the zero-baseline edge case TestBuildAuditDriftTable_ZeroBaseline was designed to lock — verified now firing on real history) - p4_* metrics +0% → ok (stable across the window) What this does NOT prove (documented in the report): the Go-side audit-FULL pipeline that PRODUCES baselines doesn't exist yet. Only the load/append/drift substrate is ported. Operators running audit-full from Go would still need a metric-collection pass — that's a separate port deliberately not in this wave. reports/cutover/SUMMARY.md gains a new row alongside the embed parity entries; cutover-prep verification log keeps the discipline of "verified against real data, not just fixtures." Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 00:20:18 -05:00
root	5687ec65c2	G5 cutover prep: embed parity probe — Rust /ai/embed ↔ Go /v1/embed verified First concrete cutover artifact: scripts/cutover/embed_parity.sh brings up Go embedd + gateway alongside the live Rust gateway, hits both /ai/embed and /v1/embed with the same forced model, and emits a per-date verdict report under reports/cutover/. Why embed first: the parity invariant is one math identity (cosine sim of vectors against same input). Retrieve has thousands of edge cases. If embed parity holds, all downstream vector consumers inherit confidence; if it doesn't, we catch it in 30s instead of after a flip. Verdict 2026-04-30: 5/5 samples cosine=1.000000 with model forced to nomic-embed-text (v1). Same with nomic-embed-text-v2-moe (both Ollamas have it loaded). Math is provably equivalent across the gateway plumbing. Drift catalog (reports/cutover/SUMMARY.md): - URL: Rust /ai/embed vs Go /v1/embed - Wire: Rust {embeddings, dimensions} (plural) vs Go {vectors, dimension} (singular). Wire-format adapter is the only real cutover work for this endpoint. - L2 norm: Rust unit vectors (~1.0); Go raw Ollama (~20-23). Same direction (cos=1.0); harmless under cosine-distance HNSW (which is Go vectord's default), but worth fixing in internal/embed/ before extending to euclidean indexes. reports/cutover/ now tracked (joined the scrum/ + reality-tests/ exemptions in .gitignore). Next probe: /v1/matrix/retrieve ↔ Rust /vectors/hybrid for the real user-facing retrieve path. Embed parity gives that probe a clean foundation. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 20:07:04 -05:00

4 Commits