lakehouse/scenarios at 6d7b251607c6a92742f4e9906482a6a67fe9be9e - lakehouse - Gitea: Git with a cup of tea

profit/lakehouse

History

root 5e89407939 Phase 23 refinement — per-staffer tool_level variance

Staffer.tool_level now controls which subsystems a specific run gets:

  full     — qwen3.5 + qwen3 + cloud T3 + cloud rescue
  local    — qwen3.5 + qwen3 + local gpt-oss:20b T3 + rescue
  basic    — qwen2.5 + qwen2.5 + local T3, no rescue
  minimal  — qwen2.5 + qwen2.5, NO T3, NO rescue. Playbook
             inheritance only.

applyToolLevel() mutates module-scoped ACTIVE_* slots each run from the
env defaults, so prior staffer's overrides never leak. Hot-path code
reads ACTIVE_EXECUTOR / ACTIVE_REVIEWER / ACTIVE_T3_DISABLED /
ACTIVE_OVERVIEW_CLOUD / ACTIVE_RETRY_ON_FAIL instead of the baked
constants.

The architectural question this answers: does playbook_memory
inheritance carry enough knowledge to let a weakly-tooled coordinator
still produce usable outcomes? "Minimal" Alex runs qwen2.5 exec + no
reviewer overseer + no cloud rescue. If Alex still fills events at a
reasonable rate, the playbook system is the real knowledge carrier —
the senior stack is nice-to-have, not the sine qua non.

Demo personas mapped:
  Maria (senior, 48mo, full)
  James (mid, 14mo, local)
  Sam (junior, 4mo, basic)
  Alex (trainee, 1mo, minimal)

Same 3 contracts (Nashville downtown, Joliet warehouse, Indianapolis
assembly) across all four → 12 runs. KB + kb_staffer_report.py
leaderboard already wired; competence_score will now reflect real tool
asymmetry instead of LLM sampling variance.

2026-04-20 22:50:05 -05:00

..

Phase 23 refinement — per-staffer tool_level variance

2026-04-20 22:50:05 -05:00

manifest.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

nashville_contract.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_000_Great_Lakes_Mfg_Cincinnati.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_001_Parallel_Machining_Joliet.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_002_Summit_Industrial_Cincinnati.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_003_Pioneer_Assembly_Chicago.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_004_Midway_Distribution_Columbus.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_005_Apex_Warehouse_Cleveland.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_006_Pioneer_Assembly_Flint.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_007_Riverfront_Steel_Toledo.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_008_Northland_Logistics_Indianapolis.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_009_Parallel_Machining_Flint.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_010_Northland_Logistics_Chicago.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_011_Heritage_Foods_Flint.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_012_Parallel_Machining_Kansas_City.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_013_Horizon_Supply_Flint.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_014_Midway_Distribution_Indianapolis.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_015_Cornerstone_Fabrication_Kansas_City.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_016_Riverfront_Steel_Columbus.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_017_Summit_Industrial_Detroit.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_018_Heritage_Foods_Cincinnati.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scen_019_Midway_Distribution_Chicago.json

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

stress_01.json

Item A — stress scenario + enriched T3 diagnostic prompt

2026-04-20 21:54:29 -05:00