golangLAKEHOUSE

profit/golangLAKEHOUSE

Fork 0

Commit Graph

Author	SHA1	Message	Date
root	68d9e554b0	shared: auto-emit Langfuse trace+span per HTTP request — closes OPEN #2 Adds langfuseMiddleware in internal/shared so every daemon's shared.Run gets free production-traffic trace visibility when LANGFUSE_URL + LANGFUSE_PUBLIC_KEY + LANGFUSE_SECRET_KEY are set. Same env names + file shape as the multi_coord_stress driver, so operators ship one /etc/lakehouse/langfuse.env across the deploy. Wiring is auth-gated: middleware runs INSIDE the RequireAuth group, so 401s from credential-stuffing don't pollute traces. /health is exempt so LB probes don't either. Missing env vars → nil client → middleware is a passthrough no-op (fail-open per ADR-005 5.1). Bundled deploy: - langfuse.env.example template (mode 0640, root:lakehouse) - 11 systemd units gain `EnvironmentFile=-/etc/lakehouse/langfuse.env` (leading - so missing file = OK) - REPLICATION.md bootstrap section documents setup Tests (4): nil passthrough, /health bypass, real-request emission, status-writer wrapping. All green. STATE_OF_PLAY OPEN list: 5 rows → 4 rows. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 19:55:42 -05:00
root	54a05d9311	Sprint 4 deployment artifacts: Dockerfile + docker-compose Parallel deploy target to the systemd units that landed in a59ef5b. Single image carries all 11 daemons; docker-compose runs one container per daemon with the same dependency graph as the systemd units. Useful when systemd isn't available (Mac dev, remote VMs without root) or when isolation to a private docker network is preferred. Dockerfile (multi-stage): - Builder: golang:1.25-bookworm. DuckDB cgo needs gcc + glibc; alpine's musl doesn't link the official duckdb-go bindings cleanly. - Runtime: debian:bookworm-slim — same libc, much smaller surface. Adds ca-certificates (outbound HTTPS to OpenRouter/OpenCode/Kimi), curl + jq (in-container healthchecks + smoke probes), tini (PID 1 signal forwarding so docker stop sends SIGTERM to the daemon, not to a wrapper). - Single image, multiple binaries. Ships all 11 cmd/* + 3 scripts/ (staffing_workers, playbook_lift, multi_coord_stress) so deployed stacks can run reality tests against themselves. - Non-root runtime user (uid 999 lakehouse). Layout matches /usr/local/bin/lakehouse/<daemon> from REPLICATION.md. - ENTRYPOINT=tini; no default CMD — operators / compose pick which daemon explicitly. docker-compose.yml (11 services): - Same dependency graph as deploy/systemd/. depends_on with service_healthy condition matches Requires= equivalents: catalogd → storaged ingestd → storaged + catalogd queryd → catalogd matrixd → embedd + vectord - Gateway uses bare depends_on (no health condition) — Wants= equivalent so single-upstream restart doesn't cascade. - chatd has per-provider env_file entries (one each for ollama_cloud, openrouter, opencode, kimi) — missing files are silently OK, matching the systemd unit's EnvironmentFile=- list. - Persistent state on the lakehouse-state named volume; commented driver_opts shows how to bind to a host path for off-volume backups. .dockerignore: - Excludes bin/ + reports/ + data/ + git metadata + .env files. - Especially excludes lakehouse.toml/secrets-go.toml/auth.env so local dev configs don't accidentally bake into a published image. REPLICATION.md gains a Docker section between systemd setup and the logs section. Ten-line copy-paste from "git clone" to "docker compose up -d", plus a docker-vs-systemd differences table covering process supervision, logs, restart policy, file ownership, host networking quirks, and backup targets. Validation: docker compose config --quiet → exit 0 (with placeholder env files in place). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 18:58:47 -05:00
root	a59ef5b930	Sprint 4 deployment artifacts: 11 systemd units + REPLICATION.md + env templates Builds on ADR-006 to ship the operator-facing bits Sprint 4 was blocked on. Single-host deploy is now a documented procedure. deploy/systemd/ (12 files): - 11 .service units, one per daemon. Each follows the same template: Type=simple, User=lakehouse, hardening (NoNewPrivileges, ProtectSystem=strict, ProtectHome, PrivateTmp, ReadWritePaths scoped to /var/lib/lakehouse + /var/log/lakehouse), JSON to journald with per-daemon SyslogIdentifier, EnvironmentFile=- on /etc/lakehouse/auth.env. - Dependency graph baked in via After=/Requires=: storaged → standalone (only network-online) catalogd → Requires storaged ingestd → Requires storaged + catalogd queryd → Requires catalogd matrixd → Requires embedd + vectord gateway → Wants every other daemon (Wants= not Requires= so a single upstream restart doesn't cascade-restart the gateway) pathwayd / observerd / vectord / embedd / chatd → standalone - chatd unit reads 4 cloud-provider EnvironmentFile=s (ollama_cloud / openrouter / opencode / kimi) — each is its own file so per-provider key rotation doesn't restart the others. - lakehouse-go.target: convenience aggregator. Operators systemctl start/stop/enable lakehouse-go.target instead of managing 11 daemons individually. Per-daemon WantedBy= this target. deploy/etc-lakehouse/ (2 templates): - auth.env.example: AUTH_TOKEN per ADR-006 6.2 + rotation playbook comments. The committed file is empty — operators copy + fill in. - secrets-go.toml.example: [s3.primary] template with REPLACE_ME placeholders. Multi-bucket G2 example commented. REPLICATION.md (top-level): - Operator runbook from fresh box → 11 daemons running. - Prereqs (Go 1.25+, gcc, MinIO, Ollama, optionally Langfuse + Postgres for Langfuse) with reachability checks. - Bind ports table (3110–3220, shifted by 10 from Rust legacy). - Bootstrap: useradd → build → install → config → secrets → systemd → validation. - Auth posture matrix (loopback / non-loopback / multi-host / TLS). - Token rotation procedure inline (ADR-006 Decision 6.5). - Logs (journalctl), backup paths, troubleshooting matrix. Validation: systemd-analyze verify passed on all 11 .service files (only "not executable" warnings, expected since binaries don't live at /usr/local/bin/lakehouse/ until step 2 of bootstrap runs). Sprint 4 is now operator-ready. Next: Dockerfile + multi-stage build for container deploys (separate concern; deploy targets either systemd OR docker, not both). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 18:54:49 -05:00

Author

SHA1

Message

Date

root

68d9e554b0

shared: auto-emit Langfuse trace+span per HTTP request — closes OPEN #2

Adds langfuseMiddleware in internal/shared so every daemon's
shared.Run gets free production-traffic trace visibility when
LANGFUSE_URL + LANGFUSE_PUBLIC_KEY + LANGFUSE_SECRET_KEY are set.
Same env names + file shape as the multi_coord_stress driver, so
operators ship one /etc/lakehouse/langfuse.env across the deploy.

Wiring is auth-gated: middleware runs INSIDE the RequireAuth group,
so 401s from credential-stuffing don't pollute traces. /health is
exempt so LB probes don't either. Missing env vars → nil client →
middleware is a passthrough no-op (fail-open per ADR-005 5.1).

Bundled deploy:
- langfuse.env.example template (mode 0640, root:lakehouse)
- 11 systemd units gain `EnvironmentFile=-/etc/lakehouse/langfuse.env`
  (leading - so missing file = OK)
- REPLICATION.md bootstrap section documents setup

Tests (4): nil passthrough, /health bypass, real-request emission,
status-writer wrapping. All green.

STATE_OF_PLAY OPEN list: 5 rows → 4 rows.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-30 19:55:42 -05:00

root

54a05d9311

Sprint 4 deployment artifacts: Dockerfile + docker-compose

Parallel deploy target to the systemd units that landed in a59ef5b.
Single image carries all 11 daemons; docker-compose runs one
container per daemon with the same dependency graph as the systemd
units. Useful when systemd isn't available (Mac dev, remote VMs
without root) or when isolation to a private docker network is
preferred.

Dockerfile (multi-stage):
- Builder: golang:1.25-bookworm. DuckDB cgo needs gcc + glibc;
  alpine's musl doesn't link the official duckdb-go bindings cleanly.
- Runtime: debian:bookworm-slim — same libc, much smaller surface.
  Adds ca-certificates (outbound HTTPS to OpenRouter/OpenCode/Kimi),
  curl + jq (in-container healthchecks + smoke probes), tini (PID 1
  signal forwarding so docker stop sends SIGTERM to the daemon, not
  to a wrapper).
- Single image, multiple binaries. Ships all 11 cmd/* + 3 scripts/
  (staffing_workers, playbook_lift, multi_coord_stress) so deployed
  stacks can run reality tests against themselves.
- Non-root runtime user (uid 999 lakehouse). Layout matches
  /usr/local/bin/lakehouse/<daemon> from REPLICATION.md.
- ENTRYPOINT=tini; no default CMD — operators / compose pick
  which daemon explicitly.

docker-compose.yml (11 services):
- Same dependency graph as deploy/systemd/. depends_on with
  service_healthy condition matches Requires= equivalents:
    catalogd → storaged
    ingestd → storaged + catalogd
    queryd → catalogd
    matrixd → embedd + vectord
- Gateway uses bare depends_on (no health condition) — Wants=
  equivalent so single-upstream restart doesn't cascade.
- chatd has per-provider env_file entries (one each for
  ollama_cloud, openrouter, opencode, kimi) — missing files are
  silently OK, matching the systemd unit's EnvironmentFile=- list.
- Persistent state on the lakehouse-state named volume; commented
  driver_opts shows how to bind to a host path for off-volume
  backups.

.dockerignore:
- Excludes bin/ + reports/ + data/ + git metadata + .env files.
- Especially excludes lakehouse.toml/secrets-go.toml/auth.env so
  local dev configs don't accidentally bake into a published image.

REPLICATION.md gains a Docker section between systemd setup and
the logs section. Ten-line copy-paste from "git clone" to
"docker compose up -d", plus a docker-vs-systemd differences
table covering process supervision, logs, restart policy, file
ownership, host networking quirks, and backup targets.

Validation: docker compose config --quiet → exit 0 (with
placeholder env files in place).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-30 18:58:47 -05:00

root

a59ef5b930

Sprint 4 deployment artifacts: 11 systemd units + REPLICATION.md + env templates

Builds on ADR-006 to ship the operator-facing bits Sprint 4 was
blocked on. Single-host deploy is now a documented procedure.

deploy/systemd/ (12 files):
- 11 .service units, one per daemon. Each follows the same template:
  Type=simple, User=lakehouse, hardening (NoNewPrivileges,
  ProtectSystem=strict, ProtectHome, PrivateTmp, ReadWritePaths
  scoped to /var/lib/lakehouse + /var/log/lakehouse), JSON to
  journald with per-daemon SyslogIdentifier, EnvironmentFile=- on
  /etc/lakehouse/auth.env.
- Dependency graph baked in via After=/Requires=:
    storaged → standalone (only network-online)
    catalogd → Requires storaged
    ingestd → Requires storaged + catalogd
    queryd → Requires catalogd
    matrixd → Requires embedd + vectord
    gateway → Wants every other daemon (Wants= not Requires=
              so a single upstream restart doesn't cascade-restart
              the gateway)
    pathwayd / observerd / vectord / embedd / chatd → standalone
- chatd unit reads 4 cloud-provider EnvironmentFile=s
  (ollama_cloud / openrouter / opencode / kimi) — each is its own
  file so per-provider key rotation doesn't restart the others.
- lakehouse-go.target: convenience aggregator. Operators
  systemctl start/stop/enable lakehouse-go.target instead of
  managing 11 daemons individually. Per-daemon WantedBy=
  this target.

deploy/etc-lakehouse/ (2 templates):
- auth.env.example: AUTH_TOKEN per ADR-006 6.2 + rotation playbook
  comments. The committed file is empty — operators copy + fill in.
- secrets-go.toml.example: [s3.primary] template with
  REPLACE_ME placeholders. Multi-bucket G2 example commented.

REPLICATION.md (top-level):
- Operator runbook from fresh box → 11 daemons running.
- Prereqs (Go 1.25+, gcc, MinIO, Ollama, optionally Langfuse +
  Postgres for Langfuse) with reachability checks.
- Bind ports table (3110–3220, shifted by 10 from Rust legacy).
- Bootstrap: useradd → build → install → config → secrets →
  systemd → validation.
- Auth posture matrix (loopback / non-loopback / multi-host / TLS).
- Token rotation procedure inline (ADR-006 Decision 6.5).
- Logs (journalctl), backup paths, troubleshooting matrix.

Validation: systemd-analyze verify passed on all 11 .service files
(only "not executable" warnings, expected since binaries don't live
at /usr/local/bin/lakehouse/ until step 2 of bootstrap runs).

Sprint 4 is now operator-ready. Next: Dockerfile + multi-stage
build for container deploys (separate concern; deploy targets
either systemd OR docker, not both).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-30 18:54:49 -05:00

3 Commits