lakehouse/scripts at 3a0b37ed93ad263166bbfea009dce9806c21e3c9 - lakehouse - Gitea: Git with a cup of tea

profit/lakehouse

History

root 3a0b37ed93

lakehouse/auditor 1 blocking issue: todo!() macro call in tests/real-world/scrum_master_pipeline.ts

v1: OpenAI-compat alias + smart provider routing — gateway is now drop-in middleware

/v1/chat/completions route alias (same handler as /chat) lets any tool
using the official `openai` SDK adopt the gateway via OPENAI_BASE_URL
alone — no custom provider field needed.

resolve_provider() extended:
- bare `vendor/model` (slash) → openrouter (catches x-ai/grok-4.1-fast,
  moonshotai/kimi-k2, deepseek/deepseek-v4-flash, openai/gpt-oss-120b:free)
- bare vendor model names (no slash, no colon) get auto-prefixed:
  gpt-* / o1-* / o3-* / o4-* → openai/<name>  (OpenRouter form)
  claude-* → anthropic/<name>
  grok-* → x-ai/<name>
  Then routed to openrouter. Ollama models (with colon, no slash) keep
  default routing. Tools like pi-ai validate against an OpenAI-style
  catalog and send bare names — this lets them flow through cleanly.

Verified end-to-end:
- curl POST /v1/chat/completions {model: "gpt-4o-mini", ...} → 200,
  routed to openrouter as openai/gpt-4o-mini
- openai SDK with baseURL=http://localhost:3100/v1 → 3 model variants all
  succeed (openai/gpt-4o-mini, gpt-4o-mini, x-ai/grok-4.1-fast)
- Langfuse traces fire automatically on every call
  (v1.chat:openrouter, provider tagged in metadata)

scripts/mode_pass5_variance_paid.ts gains LH_CONDITIONS env so subset
runs (e.g. just isolation vs composed) take half the latency.

Archon-on-Lakehouse integration: gateway side is done. Pi-ai's
openai-responses backend uses /v1/responses (not /chat/completions) and
its openrouter backend appears to bail in client-side validation before
sending. Patching Pi locally to override baseUrl works for arch but the
harness still rejects — needs more work in a follow-up. Direct openai
SDK path (langchain-js / agents / patched Pi) works today.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-26 17:49:37 -05:00

..

ab_t3_test.sh

qwen3.5 executor + continuation primitive + think:false

2026-04-20 20:19:02 -05:00

analyze_chicago_contracts.ts

scripts: chicago analyzer field-name fixes + vectorize sanitizer hardening

2026-04-25 19:34:45 -05:00

autonomous_agent.py

Fix: job tracker field name mismatch — the overnight killer

2026-04-17 10:41:32 -05:00

build_lakehouse_corpus.ts

v1/mode: model-aware enrichment downgrade + 3 corpora + variance harness

2026-04-26 17:29:17 -05:00

build_scrum_findings_corpus.ts

v1/mode: model-aware enrichment downgrade + 3 corpora + variance harness

2026-04-26 17:29:17 -05:00

build_symbols_corpus.ts

v1/mode: model-aware enrichment downgrade + 3 corpora + variance harness

2026-04-26 17:29:17 -05:00

copilot.py

Staffing Co-Pilot — the anticipation layer that changes everything

2026-04-17 00:19:07 -05:00

dump_raw_corpus.sh

raw-corpus dump + vectorization + chicago contract inference pipeline

2026-04-25 18:44:27 -05:00

generate_demo.py

Phase 6: Ingest pipeline — CSV, JSON, PDF, text file support

2026-03-27 08:07:31 -05:00

generate_workers.py

MCP server (Bun) + 100K worker generator + lakehouse integration

2026-04-16 23:54:33 -05:00

kb_measure.py

Item 3 — geo-filtered playbook boost; diagnostic logging

2026-04-20 21:35:04 -05:00

kb_staffer_report.py

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

lance_tune.py

IVF_PQ recall tuned from 0.80 → 0.97 via parameter sweep

2026-04-16 22:08:34 -05:00

mode_compare.ts

v1/mode: model-aware enrichment downgrade + 3 corpora + variance harness

2026-04-26 17:29:17 -05:00

mode_experiment.ts

v1/mode: model-aware enrichment downgrade + 3 corpora + variance harness

2026-04-26 17:29:17 -05:00

mode_pass2_corpus_sweep.ts

v1/mode: override knobs + staffing native runner + pass 2/3/4 harnesses

2026-04-26 01:55:12 -05:00

mode_pass3_variance.ts

v1/mode: override knobs + staffing native runner + pass 2/3/4 harnesses

2026-04-26 01:55:12 -05:00

mode_pass4_staffing.ts

v1/mode: override knobs + staffing native runner + pass 2/3/4 harnesses

2026-04-26 01:55:12 -05:00

mode_pass5_summarize.ts

v1/mode: model-aware enrichment downgrade + 3 corpora + variance harness

2026-04-26 17:29:17 -05:00

mode_pass5_variance_paid.ts

v1: OpenAI-compat alias + smart provider routing — gateway is now drop-in middleware

2026-04-26 17:49:37 -05:00

overnight_proof.sh

Fix: job tracker field name mismatch — the overnight killer

2026-04-17 10:41:32 -05:00

quality_eval.py

Quality evaluation pipeline — tests correctness, not just structure

2026-04-16 22:14:06 -05:00

qwen3_plan.py

Qwen 3 integration + agent plan + playbook loop

2026-04-17 00:08:48 -05:00

run_kb_batch.sh

Lift k cap, drop ornamental reason field, scenario generator

2026-04-20 20:31:34 -05:00

run_staffer_demo.sh

Phase 23 — contract terms + staffer identity + competence-weighted retrieval

2026-04-20 22:16:09 -05:00

scale_10m_test.sh

10M vector scale test — cron heartbeat, runs while J sleeps

2026-04-17 01:06:38 -05:00

scale_test.py

Scale test: 2.47M rows + 10K vector index benchmarked

2026-03-27 08:31:37 -05:00

seal_agent_playbook.ts

pathway_memory: Mem0 versioning + deletion (upsert/revise/retire/history)

2026-04-25 19:31:44 -05:00

serve_imagegen.py

Ingest Ethereal 10K worker profiles — domain data in the substrate

2026-04-16 22:26:19 -05:00

serve_lab.py

Ingest Ethereal 10K worker profiles — domain data in the substrate

2026-04-16 22:26:19 -05:00

serve_ui.py

Systemd services: gateway, sidecar, UI survive reboots

2026-03-27 22:06:28 -05:00

staffing_day.py

Staffing day simulation: 94% pass, all gates clear, ready for batching

2026-04-17 00:14:34 -05:00

staffing_demo.py

PRD v2: production roadmap with ingest, vector search, hot cache phases

2026-03-27 07:54:24 -05:00

staffing_simulation.py

Fix staffing simulation verifier + clean regression: 0 hallucinations

2026-04-16 23:28:54 -05:00

stress_test.py

Stress test suite: 9/9 passed — architecture validated

2026-03-27 22:13:27 -05:00

vectorize_raw_corpus.ts

scripts: chicago analyzer field-name fixes + vectorize sanitizer hardening

2026-04-25 19:34:45 -05:00