lakehouse/docs at 8ec43e07217ceba1cd103069e4d433a7e193350c - lakehouse - Gitea: Git with a cup of tea

profit/lakehouse

History

root 8ec43e0721 phase 1.6 Gate 3b: deepface integration design doc (3 options + recommendation)

Per docs/PHASE_1_6_BIPA_GATES.md Gate 3b. Three viable paths for
populating BiometricCollection.classifications, sized + tradeoff'd:

  Option A — Python subprocess per upload (no daemon)
    ~80 LOC, 0.5-1 day. Smallest integration. Reintroduces a Python
    dependency the 2026-05-02 sidecar drop deliberately removed.

  Option B — ONNX models in Rust (no Python at all)
    ~200-400 LOC + model-build pipeline, 5-7 days. Fully consistent
    with sidecar drop. Need pre-trained models with appropriate
    licenses (or train ourselves, multi-week). Adds face detection
    preprocessing in Rust.

  Option C — Defer; classifications field stays None
    0.25 day. BIPA-safest position; substrate is forward-compatible.
    Forces the question "do we actually need classifications?" to be
    answered by a real product requirement, not by spec inertia.

Recommendation: **Option C (defer)**, conditional on confirming the
product requirement. Reasoning:
- All BIPA-load-bearing surfaces (consent + audit + retention +
  erasure) ship without classifications
- Riskiest BIPA position is collecting demographic-derived data
  without a documented business purpose
- Substrate accommodates A or B later in 1-3 days if real demand
  surfaces

Open questions for J at the bottom of the doc — pick A/B/C is the
gating decision before any engineering happens.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-03 05:25:45 -05:00

..

phase 1.6 BIPA: scrum-driven fixes

2026-05-03 04:43:17 -05:00

distillation: Phase 9 — release freeze and operator handoff

2026-04-26 23:54:31 -05:00

policies/consent

phase 1.6 BIPA gates — engineering wave (4 of 7 staged)

2026-05-03 04:38:49 -05:00

staffing: recon + synthetic-data gap report (Phase 0, no implementation)

2026-04-27 00:02:47 -05:00

phase 1.6 BIPA gates — engineering wave (4 of 7 staged)

2026-05-03 04:38:49 -05:00

phase 1.6 Gate 3b: deepface integration design doc (3 options + recommendation)

2026-05-03 05:25:45 -05:00

ADR-017-federation.md

Federation foundation + HNSW trial system + Postgres streaming + PRD reframe

2026-04-16 01:50:05 -05:00

ADR-019-vector-storage.md

docs: sync ADR-019 + PRD + DECISIONS with 2026-05-02 substrate changes

2026-05-03 00:44:57 -05:00

ADR-020-universal-id-mapping.md

ADR-020: Universal ID mapping — fix the flat embedding identity problem

2026-04-17 11:58:18 -05:00

ADR-021-sparse-data-trust.md

ADR-021: Sparse data trust path — start with nothing, earn everything

2026-04-17 15:32:06 -05:00

ARCHITECTURE_COMPARISON.md

docs: pointer to ARCHITECTURE_COMPARISON.md source in golangLAKEHOUSE

2026-05-01 04:57:09 -05:00

AUDIT_PHASE_1_5_BIPA_AND_OUTCOMES.md

audit phase 1.5: BIPA schema audit + outcomes.jsonl content sample

2026-05-03 01:22:53 -05:00

AUDIT_PHASE_1_DISCOVERY.md

audit phase 1: §10 scrum-review findings + walk back §1F over-claim

2026-05-03 01:13:07 -05:00

AUDIT_TRAIL_PRD.md

audit docs: deprecation headers — over-scoped for local-only deployment

2026-05-03 02:42:05 -05:00

AUDITOR_CONTEXT.md

Audit pipeline PR #9 : determinism + fact extraction + verifier gate + KB stats + context injection (PR #9 )

2026-04-23 05:29:38 +00:00

CONTROL_PLANE_PRD.md

Phase 45 (first slice): DocRef + doc_refs field on PlaybookEntry

2026-04-22 03:14:07 -05:00

DECISIONS.md

docs: sync ADR-019 + PRD + DECISIONS with 2026-05-02 substrate changes

2026-05-03 00:44:57 -05:00

EXECUTION_PLAN.md

Federation foundation + HNSW trial system + Postgres streaming + PRD reframe

2026-04-16 01:50:05 -05:00

IDENTITY_SERVICE_DESIGN.md

audit docs: deprecation headers — over-scoped for local-only deployment

2026-05-03 02:42:05 -05:00

MATRIX_AGENT_HANDOVER.md

docs: add MATRIX_AGENT_HANDOVER notes + cross-link from SCRUM_MASTER_SPEC

2026-04-25 23:54:42 -05:00

MODE_RUNNER_TUNING_PLAN.md

v1/mode: model-aware enrichment downgrade + 3 corpora + variance harness

2026-04-26 17:29:17 -05:00

PHASE_1_6_BIPA_GATES.md

phase 1.6 Gate 3a: photo upload endpoint with consent gate

2026-05-03 04:55:32 -05:00

PHASE_AUDIT_GUIDE.md

chore: add real content that was sitting untracked

2026-05-02 22:22:10 -05:00

PHASES.md

docs: PHASES tracker — mark Phases 42/43/44/45 complete

2026-04-27 08:03:40 -05:00

PRD.md

docs: AUDIT_TRAIL_PRD — production-readiness gate for staffing client

2026-05-03 00:54:46 -05:00

SCRUM_FIX_WAVE.md

Scrum-driven fixes: P5-001 auth wired, P42-001 truth evaluator, P9-001 journal on ingest

2026-04-24 02:25:43 -05:00

SCRUM_FORENSIC_PROMPT.md

Scrum-driven fixes: P5-001 auth wired, P42-001 truth evaluator, P9-001 journal on ingest

2026-04-24 02:25:43 -05:00

SCRUM_LOOP_NOTES.md

docs: rewrite PR #10 description to drop unfalsifiable metric claims

2026-04-24 03:02:21 -05:00

SCRUM_MASTER_SPEC.md

docs: SCRUM_MASTER_SPEC timeline — productization wave + verified live state

2026-04-26 20:50:05 -05:00

SYSTEM_EVOLUTION_LAYERS.md

Scrum-driven fixes: P5-001 auth wired, P42-001 truth evaluator, P9-001 journal on ingest

2026-04-24 02:25:43 -05:00