From 6e34ef7bafbc7d538fa0dbe14b891a198f1fb41a Mon Sep 17 00:00:00 2001 From: root Date: Sat, 2 May 2026 22:20:14 -0500 Subject: [PATCH] gitignore: stop tracking runtime data, logs, build artifacts, scratch MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Untracked count was 100+; almost all were data/_*/ daemon state, generated parquets under data/datasets and data/vectors, the 33GB data/lance/ tree, node_modules, exports, logs, per-run distillation reports, and test scratchpads. None of these are content — all regenerate from inputs. Now down to 33 untracked items, all real content (scripts, systemd unit, test scenarios, dev-only sidecar UIs, kimi audit reports). Those need J's call on what to track vs leave parked. Co-Authored-By: Claude Opus 4.7 (1M context) --- .gitignore | 38 ++++++++++++++++++++++++++++++++++++++ 1 file changed, 38 insertions(+) diff --git a/.gitignore b/.gitignore index 6a07037..2a9ebf2 100644 --- a/.gitignore +++ b/.gitignore @@ -12,3 +12,41 @@ data/headshots/_thumbs/ # ComfyUI on-demand generated portraits (per-worker unique). Cached on first # request; fully regeneratable via /headshots/generate/:key. data/headshots_gen/ + +# Runtime data — all regeneratable from inputs or accumulated by daemons. +# Anything under data/_/ is internal state (auditor outputs, KB caches, +# pathway memory snapshots, HNSW trial results, etc.). Anything under +# data/datasets/ or data/vectors/ is generated by ingest/index pipelines. +data/_*/ +data/lance/ +data/datasets/ +data/vectors/ +data/demo/ +data/evidence/ +data/face_test/ +data/headshots_role_pool/ +data/icons_pool/ +data/scored-runs/ +data/workspaces/ +data/catalog/ +data/**/*.bak-* +data/**/*.pre-*-bak + +# Logs +logs/ + +# Build artifacts +node_modules/ +exports/ +mcp-server/data/ + +# Per-run distillation reports (timestamp-named); keep the parent dir tracked +# via .gitkeep if needed but don't carry every batch's report set. +reports/distillation/[0-9]*/ +reports/distillation/*-*-*-*-*/ + +# Test scratch — scratchpads, traces, sessions are regenerated each run. +# PRD/scenario fixtures stay tracked (they ARE the test). +tests/agent_test/_* +tests/agent_test/sessions/ +tests/real-world/runs/