lakehouse

Author	SHA1	Message	Date
root	f892230699	demo: search.html UX polish — skeleton loader, card-in stagger, hero takeover, B&W faces Search results no longer pop in as a single block. New behavior: - Skeleton list pre-claims the vertical space results will occupy with shimmering placeholder cards, so arriving results fade in over the skeleton instead of pushing layout. Sweep is staggered per row for a "rolling wave" not "everything blinking together". - Domain-language stage caption ("matching against permits", "ranking by reliability") rotates on a fixed schedule so users read progress, not a stuck spinner. - @keyframes card-in: real worker cards rise 4px and fade in over 350ms with nth-child stagger across the first ~12 rows. Honors prefers-reduced-motion. - Avatar imgs filter through grayscale + slight contrast/blur to pull the SDXL Turbo color cast (which screams "AI generated" at small sizes). Cert icons get the same treatment. - Once-per-session hero takeover compresses the Section ⓪ strip ("Not a CRM — an index that learns from you") into a centered hero on first paint, dismissed by clicking anywhere. Stats hydrate from live endpoints. console.html: mirrors the avatar B&W filter for visual consistency, and removes the headshot insertion entirely — back to monogram initials. The console (internal staffer view) doesn't need synthetic faces; the public demo at /lakehouse/ does. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 06:01:04 -05:00
root	f9a408e4c4	Surname → ethnicity routing + ComfyUI fallback for sparse pool buckets + cache-buster Three problems J flagged ("not matching properly", "same faces", "still showing old icons") had three different roots: 1. MISMATCH: front-end was first-name only, so "Anna Cruz" / "Patricia Garcia" / "John Jimenez" all defaulted to caucasian. Added SURNAMES_HISPANIC / _SOUTH_ASIAN / _EAST_ASIAN / _MIDDLE_EASTERN dicts to both search.html and console.html. Surname is checked FIRST (stronger signal for hispanic + asian than first names), then first-name fallback. Cruz → hispanic, Patel → south_asian, Nguyen → east_asian, regardless of first name. 2. SAME FACES: pool buckets are uneven — woman/south_asian=3, man/black=4, woman/middle_eastern=2 — so any worker in those buckets collapses to 2-4 photos no matter how good the hash is. /headshots/:key now 302-redirects to /headshots/generate/:key when the gender × race intersection is below 30 faces. ComfyUI on-demand gives infinite uniqueness for the sparse buckets (deterministic-per-worker via djb2 seed). Dense buckets still serve from the pool — no GPU cost there. 3. STALE CACHE: Cache-Control was max-age=86400, immutable — pinned old photos in browsers for 24h after any server-side update. Dropped to max-age=3600, must-revalidate, and added a v=2 cache-buster query param to all front-end /headshots/ URLs so existing cached entries are bypassed on next page load. Also surfacing X-Face-Pool-Bucket / Bucket-Size headers for diagnosis. Verified: playwright run shows surname routing correct (Torres, Rivera, Alvarez, Gutierrez, Patel, Nguyen, Omar all bucketed correctly), sparse buckets 302 to ComfyUI, dense buckets stay on the thumb pool. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 06:01:04 -05:00
root	a3b65f314e	Synthetic face pool — 1000 StyleGAN headshots, ComfyUI hot-swap, 60x smaller thumbs Worker cards now ship a real photo per person instead of monogram tiles: - fetch_face_pool.py pulls 1000 faces from thispersondoesnotexist.com - tag_face_pool.py runs deepface for gender/race/age, excludes <22yo - manifest.jsonl: 952 servable, gender/race buckets populated - /headshots/_thumbs/ pre-resized to 384px webp (587KB -> 11KB, 60x smaller; without this Chrome's parallel-connection budget drops ~75% of tiles in a 40-card grid) - /headshots/:key gender x race x age intersection bucketing with gender-only fallback when intersection is sparse - /headshots/generate/:key ComfyUI on-demand for the contractor profile spotlight (cold ~1.5s, cached ~1ms; worker-derived djb2 seed makes faces deterministic-per-worker but unique across workers sharing the same prompt) - serve_imagegen.py _cache_key() now includes seed (was caching by prompt only -> 3 different worker seeds collapsed to 1 cached image; verified fix produces 3 distinct md5s) - confidence-default name resolution: Xavier->man+hispanic, Aisha->woman+black, etc. Every worker resolves to a bucket. End-to-end: playwright run on /?q=forklift+operators+IL -> 21/21 cards loaded, 0 broken, all 384px webp. Cache + binary pool gitignored; manifest tracked. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 06:01:04 -05:00
root	10ed3bc630	demo: real synthetic headshots — fetch pool + serve route + UI wire Three layers shipped: 1. SCRIPT — scripts/staffing/fetch_face_pool.py Pulls N synthetic StyleGAN faces from thispersondoesnotexist.com into data/headshots/face_NNNN.jpg, writes manifest.jsonl. Idempotent: re-running skips existing files. Optional gender tagging via deepface (currently unavailable on this box; the script handles ImportError gracefully and tags everything as untagged). Fetched 198 faces with concurrency=3 in ~67s. 2. SERVER — /headshots/:key route in mcp-server/index.ts Loads manifest at first hit, caches in globalThis._faces. Hashes the key with djb2-style mixing → pool index → returns the JPG. Same key always gets the same face (deterministic). Accepts ?g=man\|woman&e=caucasian\|black\|hispanic\|south_asian\|east_asian\|middle_eastern to bias pool selection — the gender/ethnicity buckets fall back to the full pool when no tagged matches exist. Cache-Control: 86400 immutable so faces ride the browser cache after first hit. /headshots/__reload re-reads the manifest without restart. 3. UI — search.html + console.html worker cards Re-added overlay <img> on top of the monogram .av circle. img.src = /headshots/<encoded-key>?g=<hint>&e=<hint>. img.onerror removes the failed image so the monogram stays visible if the face pool isn't fetched / CDN is blocked. .av now has overflow:hidden + position:relative to clip the img to a perfect circle. Forced-confident name resolution (J: "we're CREATING the profile, created as though you truly have the information Xavier is more likely Hispanic and he's a male"): genderFor(name) — looks up MALE_NAMES + FEMALE_NAMES, falls back to a deterministic hash split so unknown names spread ~50/50. Sets now include cross-cultural names: Alejandro/ Andres/Mateo/Santiago/Joaquin/Cesar/Hugo/ Felipe/Gerardo/Salvador/Ramon (Hispanic), Raj/Anil/Vikram/Krishna/Pradeep (South Asian), Wei/Yi/Hiroshi/Akira/Hyun (East Asian), Demetrius/Kareem/DaQuan/Khalil (Black), Omar/Khalid/Hassan/Ahmed/Bilal (Middle Eastern). FEMALE_NAMES extended in parallel. guessEthnicityFromFirstName(name) — confident default of 'caucasian' for any name not in the cultural buckets so every worker resolves to a category the face pool can be biased toward. Order: ME → Black → Hispanic → South Asian → East Asian → Caucasian (matters where names overlap, e.g. Aisha appears in ME + Black, biases toward ME for visual fit). Both helpers also ported into console.html so the triage backfills and try-it-yourself rendering get the same hint stack. Privacy note in the script + route comments: the synthetic data uses the worker's name as the seed; production should hash worker_id (not name) to avoid leaking PII to a third-party CDN. The fetch URL itself is referenced once per pool build, not per-worker. .gitignore — added data/headshots/face_*.jpg (~100MB for 198 faces; the manifest + script are tracked). Re-running the script on a fresh checkout rebuilds the pool from scratch. Verified end-to-end via playwright on devop.live/lakehouse: forklift query → 10 worker cards 10/10 with face images (real synthetic headshots, not monograms) 0/10 broken Alejandro G. Nelson → ?g=man&e=hispanic Patricia K. Garcia → ?g=woman&e=caucasian Each name → unique face, deterministic across loads. Console triage backfills get the same treatment.	2026-04-28 06:01:04 -05:00
root	f92b55615f	demo: worker cards — sober monogram avatars + role bands (no cartoons) J: "It's two cartoonish right now the website looks like it was made by first grade teacher." Pulled the DiceBear personas-style headshots and the emoji role badges. They were generative-illustration playful; this is supposed to read like a staffing tool, not a kindergarten attendance sheet. Replacement design — restraint, signal, no glyphs: Avatar: 40px circle, monogram initials, muted dark background (#161b22), 1px ring (#21262d), white-ish text. No image, no emoji. Looks like a pre-photo placeholder slot in a real ATS. Role band: the role gets classified into one of five families: WAREHOUSE / PRODUCTION / SKILLED TRADE / DRIVER / LEAD (regex-based; falls back to the first word of the role for unknown families). Each family has a single muted color: blue / amber / purple / green / orange. The color appears as: - a 3px left border on the .iworker card - a 2px left border + matching text color on a small uppercase pill in the detail line That's it. No images, no emojis, no per-role illustrations. The staffer sees role-family at a glance via the band color, name and initials prominently, full role + city + zip in the detail string behind the pill. Five colors total instead of an eight-color rainbow. CSS: .iworker[data-role-band="warehouse"] etc. → 3px left border .role-pill[data-rb="warehouse"] etc. → matching pill border JS: ROLE_BANDS = 6 regex → band+label entries (warehouse, production, trades, driver, lead, quality) roleBand(role) = first matching entry, fallback to first word of role uppercased Verified end-to-end via playwright on devop.live/lakehouse: forklift query → 10 cards every card → monogram avatar + WAREHOUSE pill (blue band) no images, no emojis, no rainbow Restart sequence after these edits: pkill -9 -f "/home/profit/lakehouse/mcp-server/index.ts" ( setsid bun run /home/profit/lakehouse/mcp-server/index.ts \ > /tmp/mcp-server.log 2>&1 < /dev/null & disown )	2026-04-28 06:01:04 -05:00
root	db81fd8836	demo: System Activity panel — capability index reflects every recent shipment Old panel showed playbook ops + search counts and went empty in a fresh demo (no operations yet). J: "update System Activity to coincide with all of our recent updates." Rebuilt as a live capability index — each tile is a thing the substrate has learned to do, with the metric proving it's running. Pulled in parallel from /staffers, /system/summary, /api/vectors/playbook_memory/stats, /api/vectors/pathway/stats, /intelligence/profiler_index, /intelligence/activity. Each probe catches its own error so a single missing endpoint doesn't collapse the panel. Nine capability cards (verified end-to-end on devop.live/lakehouse): 1. Per-staffer hot-swap index 3 personas (Maria/Devon/Aisha) 2. Construction Activity Signal Engine 11 issuers · $347M attributed build value · network 11/14 3. Late-worker / no-show triage one-shot — name+late → backfills+SMS 4. Permit → staffing bridge 24/day, every Chicago permit ≥$250K 5. Hybrid SQL + vector search 500K workers · 5,474 playbook entries 6. Schema-agnostic ingestion 36 datasets · 2.98M rows 7. Contractor profile + project index 6 wired · 12 queued sources 8. Pathway memory 88 traces · 11/11 replays · 100% 9. Ticker association network 11 tickers · 3 direct + 11 associated Each card carries: - capability title + ship date pill ("baseline" or "shipped 2026-04-27") - big metric (live, not pre-baked) - sub-context line in coordinator language - "why a staffer cares" explanation - optional "Open →" deep link to the surface (Profiler, Contractor) Header + intro paragraph reframed: "what the substrate has learned to do" instead of "what the substrate has learned." Operational learning (fills, playbooks, hot-swaps) compounds INSIDE each capability; the panel surfaces the set of capabilities the corpus knows how to express. Closing operational-stats row at the bottom shows fills/searches/ recent playbooks when /intelligence/activity has any.	2026-04-28 06:01:04 -05:00
root	f6a7621b2d	demo: profiler index — directory of every Chicago contractor J asked for "a profiler index that shows a history of everyone." This is a /profiler directory page (also reachable via /contractors) that ranks every contractor who's filed a Chicago permit, by total permit value. Rows are clickable into the full /contractor profile. Defaults: since 2025-06-01, min permit cost $250K, top 200 contractors by total_cost. Server pulls two Socrata GROUP BY queries (one keyed on contact_1_name, one on contact_2_name), merges them so contractors listed in either applicant or contractor slot appear once with combined counts/cost. ~300ms cold. UI: live search box, since-date selector, min-cost selector, sortable columns (name / permits / total_cost / last_filed). Live numbers as of this write: 200 contractors, 1,702 permits, $14.22B aggregate. Filter "Target" returns TARGET CORPORATION + CORPORATION TARGET (name variants from Socrata). Also fixes J's other complaint — "no new contracts, Target is gone": /intelligence/permit_contracts was hard-capped at $limit=6 + only the most recent 6 over $250K, so any day with 6 fresh permits would push older contractors (Target) off the panel entirely. Now defaults to 24 (caller can pass body.limit up to 100), so 2-3 days of permits stay on the panel. Added body.contractor — passes a name into the WHERE so the staffer can pin a specific contractor to the panel ("Target Corporation" → 3 of their permits over $250K). Server-side: - new POST /intelligence/profiler_index — paginated contractor index (since, min_cost, search, limit) with merged contact_1+contact_2 aggregations - /intelligence/permit_contracts — body.limit + body.contractor - /profiler and /contractors routes serve profiler.html Front-end: - new mcp-server/profiler.html — sortable table, live filter, deep links to /contractor?name=... (prefix-aware via P, so /lakehouse works on devop.live) - search.html + console.html nav: added "Profiler" link Verified end-to-end via playwright on the public URL.	2026-04-28 06:01:04 -05:00
root	31d8ef918c	demo: contractor links — respect the /lakehouse path prefix J reported https://devop.live/contractor?name=3115%20W%20POLK%20ST.%20LLC returned 404. Cause: the anchor href was a bare /contractor, which on devop.live routes to the LLM Team UI (port 5000) at the main site root, not the lakehouse mcp-server (which lives under /lakehouse/*). Every page that renders a contractor link now uses the same prefix detector the dashboard already had: var P = location.pathname.indexOf('/lakehouse') >= 0 ? '/lakehouse' : ''; Files updated: - search.html: entity-brief anchor + preview anchor → P+/contractor - console.html: permit-card contractor list → P+/contractor - contractor.html: history.replaceState + back-link + the /intelligence/contractor_profile fetch all use P prefix. The page is reachable at /lakehouse/contractor on the public URL and bare /contractor on localhost; both work without further config. Verified: https://devop.live/lakehouse/contractor?name=3115%20W%20POLK%20ST.%20LLC → 200, 29.9 KB, full profile renders. Contractor has 1 permit on file (a small LLC), 1 geocoded so the heat map plots one marker.	2026-04-28 06:01:04 -05:00
root	5f0beffe80	demo: G — per-staffer hot-swap index (synthetic coordinator personas) Same corpus, different relevance gradient per staffer. Three personas defined in mcp-server/index.ts STAFFERS roster (Maria/IL, Devon/IN, Aisha/WI), each with a primary state + secondary cities. Server-side: /intelligence/chat smart_search accepts a staffer_id body field; when set, defaults state to the staffer's territory and labels the playbook context as theirs. The playbook patterns query also defaults its geo to the staffer's primary city/state, so the recurring-skills/cert breakdowns reflect what they actually fill, not the global IL prior. Front-end: a staffer selector dropdown beside the existing state/role filters. Picking a staffer auto-pins state to their territory, shows a greeting line, relabels the MEMORY panel as MARIA'S/DEVON'S/AISHA'S MEMORY, and sends staffer_id to chat for scoping. Dropdown is populated from /staffers (NOT /api/staffers — the generic /api/* passthrough sends everything under /api/ to the Rust gateway, which doesn't own the roster). loadStaffers runs at window-load independently of loadDay's Promise.all so the dropdown populates even if simulation/SQL inits error out. Verified end-to-end via playwright. Same q="forklift operators": no staffer → 509 workers across MI/OH/IA, MEMORY label as Devon → 89 IN-only (Fort Wayne, Terre Haute), DEVON'S MEMORY as Aisha → 16 WI-only (Milwaukee, Madison, Green Bay), AISHA'S MEMORY As Maria with q="8 production workers near 60607": tags: headcount: 8 · zip 60607 → Chicago, IL · role: production · city: Chicago 20 workers, MARIA'S MEMORY label, top results in Chicago zips Closes the demo-side build of A-G from the persona plan: A. zip → city/state, B. headcount, C. bare-name, D. temporal, E. late-worker triage, F. contractor anchor, G. per-staffer index.	2026-04-28 06:01:04 -05:00
root	677065de76	demo: P2 — staffer-language routes (zip, headcount, name, late-triage, ingest log) Built from a playwright run as three personas: Maria — "8 production workers near 60607 by next Friday, prior-fill at this client" Devon — "what came in last night?" Aisha — "Marcus running late site 4422" Each one previously fell through to smart_search and returned irrelevant results (geo wrong, headcount ignored, no triage, no temporal). Now: A. Zip code → city/state lookup. Chicago zips (606xx, 607xx, 608xx) resolve to {city: Chicago, state: IL}; 13 metro prefixes covered. Maria's "near 60607" now returns Chicago workers, not Dayton/Green Bay. B. Headcount parser. "8 production workers" / "12 forklift operators" / "5 welders" set top_k 1..200, capped 5..25 for SQL+vector LIMIT. Allows 0-2 role words between the count and the worker noun so "8 production workers" matches as well as "8 workers". C. Bare-name profile lookup. Single short capitalized phrase ("Marcus" / "Sarah Lopez") triggers a profile route. Per-token LIKE AND-joined so "Marcus Rivera" matches "Marcus L. Rivera" without hardcoding middle initials. E. Late-worker / no-show triage. Pattern: <Name> (running late\|late\| no show\|sick\|out today\|called out\|can't make it) — pulls profile + reliability + responsiveness + recent calls, sources 5 same-role same-geo backfills sorted by responsiveness, drafts a client SMS the coordinator can copy. Front-end renders triage card + Copy SMS button + green backfill list. F. Contractor name preview anchor. The PROJECT INDEX preview line on each permit card now wraps contact_1_name and contact_2_name in anchors to /contractor?name=... — clicking a contractor finally navigates instead of doing nothing. Click handler stops propagation so the details element doesn't toggle. D. Temporal "what came in" route. last night / today / past N hours / recent — surfaces datasets from the catalog whose updated_at is within the window, samples one row per dataset to detect worker- shape, groups by role for worker tables. Schema-agnostic — drop any dataset and it shows up. Currently sparse because no fresh ingest has happened today; will populate as ingest runs. Server: /intelligence/chat smart_search route accepts structured state/role from the search-form dropdowns (P1 from prior commit) and now ALSO honors b.state, b.role, q.match for headcount + zip + name + triage patterns BEFORE falling through to NL parsing. Front-end: doSearch dispatches on response.type and renders triage, profile, ingest_log, and miss states with type-specific UI. All DOM construction uses textContent / appendChild — no innerHTML, no XSS. Verified end-to-end via playwright drive of devop.live/lakehouse: Maria → 8 Chicago Production Workers (60685, 60662, 60634) tags: "headcount: 8 · zip 60607 → Chicago, IL · ..." Aisha → Marcus V. Campbell card + draft SMS + 5 Quincy IL backfills "I'm dispatching Scott B. Cooper (96% reliability) to cover." Devon → ingest_log surfaces successful_playbooks_live (last 1h) Marcus → 5 profiles (Adams Louisville KY, Jenkins Green Bay WI, ...) Screenshots: /tmp/persona_v2/{01_maria,02_aisha,03_devon,04_marcus}.png Restart sequence after these edits: pkill -9 -f "mcp-server/index.ts" ; cd /home/profit/lakehouse ; bun run mcp-server/index.ts. The bun on :3700 is not systemd-managed (pre-existing convention).	2026-04-28 06:01:04 -05:00
root	fb99e92a60	demo: P1 — search filter now actually filters by state and role The Co-Pilot search box read state and role from the dropdowns (#sst, #srl) but appended them to the message string as ' in '+st. The server's NL parser then matched the literal preposition "in" against the case-insensitive regex /\b(IL\|IN\|...)\b/i and assigned state IN (Indiana) to every search. Result: typing "forklift in IL" returned Indiana workers. Same for WI, TX, any state — all silently became Indiana. That was the "cached/generic response" the legacy staffing client was seeing. Two prongs: 1. search.html doSearch() now passes structured fields: {message, state, role} instead of munging into the message text. Dropdown selections bypass NL parsing entirely. 2. /intelligence/chat smart_search route accepts those structured fields and prefers them over regex archaeology. Falls back to NL parsing only when fields aren't provided. Fixed the regex too: the prepositional form (?:in\|from)\s+(STATE) wins, the standalone form requires uppercase (drops /i flag) so the lowercase preposition "in" can no longer match. Verified live: - POST /intelligence/chat {"message":"forklift","state":"IL"} → 167 IL forklift operators (Galesburg, Joliet, ...) - POST /intelligence/chat {"message":"forklift","state":"WI","role":"Forklift Operator"} → 16 WI Forklift Operators (Milwaukee, Madison, ...) - POST /intelligence/chat {"message":"forklift in IL"} (NL fallback) → 167 IL workers (regex now correctly distinguishes preposition from state code) Playwright drove the live UI through devop.live/lakehouse and confirmed the front-end posts the structured body and the result panel renders the right state. Restart sequence: kill old bun :3700, bun run mcp-server/index.ts.	2026-04-28 06:01:04 -05:00
root	b843a23433	mcp: contractor entity-brief drill-down + mobile UX pass Adds /contractor page route plus /intelligence/contractor_profile endpoint that fans out across OSHA, ticker, history, parent_link, federal contracts, debarment, NLRB, ILSOS, news, diversity certs, BLS macro — single per-contractor portfolio view across every wired source. search.html: mobile responsive layout, fixed bottom dock with horizontal scroll-snap, legacy bridge row stacking, viewport overflow guards. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 17:07:23 -05:00
root	858954975b	Staffing Co-Pilot UI — architecture-first enrichments + shift clock Some checks failed lakehouse/auditor 2 blocking issues: todo!() macro call in tests/real-world/scrum_master_pipeline.ts J's direction: the dashboard was explanatory but not actionable as a staffing-matrix console. Refactor so the architecture claims from docs/PRD.md surface as operational signals on every contract card. Backend (mcp-server/index.ts): + GET\|POST /intelligence/arch_signals — probes live substrate health so the dashboard shows instant-search latency, index shape, playbook-memory entries, and pathway-memory (ADR-021) trace count. Fires one fresh /vectors/hybrid probe against workers_500k_v1 so the "instant search" number on screen is live, not cached. * /intelligence/permit_contracts now times every hybrid call per contract and returns search_latency_ms, so the card can display the per-query latency pill (⚡ 342ms). + Per-contract computed fields returned from the backend: search_latency_ms — real /vectors/hybrid duration fill_probability — base_pct (by pool_size×count ratio) + curve [d0, d3, d7, d14, d21, d30] with cumulative fill% per bucket economics — avg_pay_rate, gross_revenue, gross_margin, margin_pct, payout_window_days [30, 45], over_bill_count, over_bill_pool_margin_at_risk shifts_needed — 1st/2nd/3rd/4th inferred from permit work_type + description regex * Pre-existing dangling-brace bug in api() fixed (the `activeTrace` logging block had been misplaced at module scope, referencing variables that only existed inside the function). Restart was failing with "Unexpected }" at line 76. Moved tracing inside the try block where parsed/path/body/ms are in scope. Frontend (mcp-server/search.html): + Top "Substrate Signals" section — 4 live tiles (instant search, index, playbook memory, pathway matrix). Color-codes latency (green <100ms, amber <500ms, red otherwise). + "24/7 Shift Coverage" section — SVG 24-hour clock with 4 colored shift arcs (1st/2nd/3rd/4th), current-time needle, center label showing the live shift, per-shift contract count tiles beside. 4th shift assumes weekend/split; handles 3rd-shift wrap across midnight by splitting into two arcs. + Per-card architecture pills: instant-search latency, SQL-filter pool-size with k=200 boost note, shift requirements. + Per-card fill-probability horizontal stacked bar with day markers (d0/d3/d7/d14/d21/d30) and per-bucket segment shading (green → amber → orange → red as time decays). + Per-card economics 4-tile grid: Est. Revenue, Est. Margin (with % colored by health), Payout Window (30–45d standard), Over-Bill Pool count + margin at risk. Architecture smoke test (tests/architecture_smoke.ts, earlier commit) still green: 11/11 pass including the new /intelligence/arch_signals + permit_contracts enrichments. J specifically wanted: "shoot for the stars · hyperfocus · our architecture is better because it self-regulates, uses hot-swap, pulls from real data, and shows instant searches from clever indexing." Every one of those is now a specific visible signal on the page, not prose in the README. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 14:24:11 -05:00
root	0ff091c173	Honesty fixes — no hard-coded counts, dynamic sample CSV - generateSampleRosterCSV(): 120-180 randomized rows per call, timestamp-prefixed IDs (no dedup on re-upload, no static 25 row lie) - /system/summary: truth via SQL COUNT(*), surfaces manifest_drift (caught candidates: manifest 100K, actual 1K) - search.html: loadSystemSummary() hydrates live counts; removed hard-coded 500K strings - MCP tool description: "candidates (100K)" → "candidates (1K)", added "workers_500k (500K)"	2026-04-20 19:07:47 -05:00
root	af3856b103	Rate/margin awareness: implied pay rate per worker, bill rate per contract Closes one of the Path 1 trust-break gaps. The scenario we kept flagging: recruiter calls the system's top pick, worker quotes $35/hr, contract pays $28/hr. First broken call kills the demo. This fixes it. Heuristic (no schema change, derived at query time): - Per worker: implied_pay_rate = role_base + (reliability × 4) + archetype_bump role_base: Electrician $28, Welder $26, Machine Op $24, Maint $26, Forklift Op $20, Loader $17, Warehouse Assoc $17, Quality Tech $23, Production Worker $18 ... archetype bump: specialist +4, leader +3, reliable +1, else 0 - Per contract: implied_bill_rate = role_base × 1.4 (40% markup — industry norm: pay + overhead + insurance + margin) - Worker is 'over_bill_rate' when implied_pay_rate > contract's bill_rate on a candidate-by-candidate basis Backend (mcp-server/index.ts): - ROLE_BASE_PAY_RATE + BILL_MARKUP constants - impliedPayRate(worker), impliedBillRate(role) functions - parseWorkerChunk() extracts role/reliability/archetype from vector text - enrichWithRates() attaches implied_pay_rate on every /vectors/hybrid source response. Called from /search and /intelligence/permit_contracts. - /search accepts optional max_pay_rate number — if set, filters out workers above that rate and reports pay_rate_filtered_out count. - /intelligence/permit_contracts returns implied_bill_rate per contract AND over_bill_rate boolean per candidate. Frontend (search.html): - Live Contracts cards show 'bill rate: $X/hr' under the headcount line - Each candidate shows 'pay $X/hr' in the sub-line; red 'Over bill rate' chip next to name when their pay exceeds the contract's bill rate (hover reveals the exact numbers and why it's flagged) - Main 'Search all workers' results now include 'pay $X/hr' in the why-text (computeImpliedPayRate mirrored client-side to match Bun) End-to-end verified live: - Masonry Work permit, bill_rate $25.20/hr Kathleen M. Gutierrez pay $25.56/hr → 🔴 OVER Melissa C. Rivera pay $20.88/hr → 🟢 OK - /search with max_pay_rate:32 filtered out 1 Toledo Welder above $32 - Main search shows 'pay $28.64/hr' in each result row When real ATS data replaces synthetic workers_500k, same UI — the client's real pay_rate column substitutes for the heuristic.	2026-04-20 18:56:51 -05:00
root	a117ae8b38	Workspace UI — surface Phase 8.5 per-contract state + handoff Phase 8.5 was fully built on the Rust side (WorkspaceManager with create/handoff/search/shortlist/activity/get/list, persisted to object storage, zero-copy handoff between agents). Nothing surfaced it in the recruiter UI. This page closes that gap. /workspaces — split-pane UI: Left: scrollable list of all workspaces, sorted by updated_at. Each card shows name, tier pill (daily/weekly/monthly/pinned), current owner, count of shortlisted candidates + activity events. Right: selected workspace detail with five sections: 1. Header — name, tier, owner, created/updated dates, description, previous-owners audit trail (each handoff is preserved) 2. Actions row — Hand off, Shortlist candidate, Save search, Log activity 3. Shortlist — candidates flagged with dataset + record_id + notes 4. Saved searches — named SQL queries the staffer wants to rerun 5. Activity — chronological (newest first) log of what happened Four modals for the add/edit actions (create, handoff, shortlist, save-search, log-activity). All forms POST through the existing /api/* passthrough to the gateway's /workspaces/* routes. End-to-end verified live: 1. Sarah creates 'Demo: Toledo Week 17' workspace 2. Shortlists Helen Sanchez (W500K-4661) with notes about prior endorsements 3. Logs activity: 'called — Helen confirmed Tuesday 7am shift' 4. Hands off to Kim with reason 'end of shift' 5. Kim opens the workspace: owner=kim, previous_owners=[{sarah→kim}], sees all 3 prior events + the shortlisted Helen — no data copy, pointer swap only (Phase 8.5 design) Security: all dynamic content built via el(tag,cls,text) DOM helper. Zero innerHTML on API-derived strings. Modal close-on-backdrop-click is guarded to the backdrop element. Nav updated across all 7 pages. Workspaces is the 7th tab. Dashboard · Walkthrough · Architecture · Spec · Onboard · Alerts · Workspaces.	2026-04-20 18:36:51 -05:00
root	6287558493	Push/daemon presence: background digest + /alerts settings page Converts the app from 'dashboard you visit' to 'system that finds you.' Critical for the phone-first staffing shop that won't open a URL — the system reaches out when something matters. Daemon: - Starts once per Bun process (guarded via globalThis sentinel) - Default interval 15 min (configurable, min 1, max 1440) - On each cycle, buildDigest() compares current state against prior snapshot persisted in mcp-server/data/notification_state.json - Events detected: - risk_escalation: role moved to tight or critical (was ok/watch) - deadline_approaching: staffing window falls within warn window (default 7 days) AND deadline date differs from prior - memory_growth: playbook_memory entries grew by >= 5 since last run Channels (all opt-out individually via config): - console: always on, logged to journalctl -u lakehouse-agent - file: always on, appends JSONL to mcp-server/data/notifications.jsonl - webhook: optional, POSTs {text, digest} to configured URL (Slack incoming-webhook / Discord webhook / any custom endpoint) Digest format (human-readable, fits in a Slack message): LAKEHOUSE DIGEST — 2026-04-20 23:24 3 staffing deadlines within window: • Production Worker — 2d to 2026-04-23 · demand 724 • Maintenance Tech — 4d to 2026-04-25 · demand 32 • Electrician — 5d to 2026-04-26 · demand 34 +779 new playbooks (total 779, 2204 endorsed names) snapshot: 0 critical · 0 tight · $275,599,326 pipeline /alerts page: - Current status table (daemon state, interval, webhook, last run) - Config form: enable toggle, interval, deadline warn window, webhook URL + label (saved to data/notification_config.json) - 'Fire a test digest now' button — force a cycle without waiting - Recent digests panel shows the last 10 dispatches with full text End-to-end verified live: - Daemon armed successfully on startup - First-run digest dispatched to console + file in <1s - Events detected correctly: 3 deadlines within 7 days from real Chicago permit data; 779 playbook entries surfaced as memory growth - Digest text format is Slack-pastable - Dispatch records appear in /alerts recent list TDZ caveat: startAlertsDaemon() invocation moved to end of module so all const/let in the alerts block evaluate before daemon reads them. Previously failed with 'Cannot access X before initialization' when the call lived near the top of the file. Nav added to all 6 pages: Dashboard · Walkthrough · Architecture · Spec · Onboard · Alerts.	2026-04-20 18:24:48 -05:00
root	23eb04a145	Onboarding wizard — ingest any staffing CSV in 3 steps New /onboard page. Client-facing wizard for getting real data into the system without engineering help. Flow: 1. Drop a CSV (or click 'Use the sample as my data' — ships a 25-row realistic staffing roster under /samples/staffing_roster_sample.csv) 2. Browser parses client-side. Columns auto-typed (text/int/decimal/ date). PII flagged by name hint AND content regex (emails, phones). First rows previewed. Read-only — nothing written yet. 3. Name the dataset (lowercase+underscores). Commit. 4. Post-commit: dataset is live. Shows 4 next steps the operator can take (SQL query, vector index, dashboard search, playbook training). Backend: - /onboard serves onboard.html - /samples/.csv serves CSV files from mcp-server/samples/ with filename validation (only [a-zA-Z0-9_-.]+.csv, prevents path traversal) - /onboard/ingest forwards multipart/form-data to gateway /ingest/file preserving the boundary. The generic /api/ passthrough breaks multipart because it reads as text and forwards as JSON; this route uses arrayBuffer + original Content-Type. Verified end-to-end: upload sample roster (25 rows, 12 columns) → parse in browser → show columns + PII flags + preview → commit → gateway writes Parquet, registers in catalog → immediately queryable: SELECT * FROM onboard_demo2 LIMIT 3 → Sarah Johnson, Forklift Operator, Chicago, IL, 0.92 Round-trip <1 second. Nav updated on all pages to link Onboard. Shipped with a sample CSV so the full flow is demonstrable without real client data. When a real client shows up, same path — they upload their CSV. No engineering ticket, no code change, no schema pre-definition. Security: sample filename regex prevents path traversal. CSV parse is client-side pure JS (no DOM injection). Commit uses existing /ingest/file validation (schema fingerprint, PII server-side, content-hash dedup).	2026-04-20 18:13:56 -05:00
root	468798c9ac	/spec: technical specification — 11-chapter README-equivalent J's ask: explain the full architecture so someone reading a README can dispute it or recreate it. The repo isn't public yet; this page IS the spec until it is. Ch1 Repository layout — 13 crates + tests/multi-agent + docs + data, with owned responsibility and file path per crate. Ch2 Data ingest pipeline (8 steps) — sources (file/inbox/DB/cron), parse+normalize with ADR-010 conservative typing, PII auto-tag, dedup, Parquet write, catalog register with fingerprint gate, mark embeddings stale, queryable immediately. Ch3 Measurement & indexing — row count / fingerprint / owner / sensitivity / freshness / lineage per dataset. HNSW vs Lance tradeoff table with measured numbers (ADR-019). Autotune loop. Per-profile scoping (Phase 17). Ch4 Contract inference from external signal — Chicago permit feed → role mapping → worker count heuristic → timeline → hybrid search with boost → pattern discovery → rendered card. All pre-computed before staffer opens UI. Ch5 What a CRM can't do — 11-row comparison table of capabilities. Ch6 How it gets better over time — three paths: - Phase 19 playbook boost (full math) - Pattern discovery meta-index - Autotune agent Ch7 Scale story: 20 staffers, 300 contracts, midday +20/+1M surge - Async gateway + per-staffer profile isolation + client blacklists - 7-step surge handling flow (ingest, stale-mark, incremental refresh, degradation, hot-swap, autotune re-enter) - Known pain points: Ollama inference serial, RAM ceiling ~5M on HNSW (mitigated by Lance), VRAM 1-2 models sequential, playbook_memory unbounded. Ch8 Error surfaces & recovery — 10-row table covering ingest schema conflicts, bucket failures, ghost names, dual-agent drift, empty searches, Ollama down, gateway restart, schema fingerprint divergence. Every failure has a named surface and recovery path. Ch9 Per-staffer context — active profile, workspace, client blacklist, audit trail, daily summary. How 20 staffers don't see the same UI. Ch10 Day in the life — 07:00 housekeeping → 07:30 refresh → 08:00 staffer opens → 08:15 drill down → 08:30 Call click → 09:00 second staffer shares memory → 12:30 surge → 14:00 no-show → 15:00 new embeddings live → 17:00 retrospective → 22:00 overnight trials. Ch11 Known limits & non-goals — deferred (rate/margin, push, confidence calibration, neural re-ranker, pm compaction, call_log cross-ref) and explicitly out-of-scope (cloud, ACID, streaming, CRM replace, proprietary formats, hard multi-tenant). Also: nav updated on /dashboard, /console, /proof to link /spec. Every architectural claim in the spec cites either a code path, an ADR number, or a phase reference so someone skeptical can target the specific artifact.	2026-04-20 17:56:18 -05:00
root	bb1b471c67	Predictive staffing forecast + per-contract timeline J's ask: move the system from retrospective ranking to predictive anticipation. Show it tracks the clock, not just the roster. New endpoint /intelligence/staffing_forecast: - Pulls 30-day Chicago permit window (200 permits) - Maps work_type → role via industry heuristic - Aggregates predicted worker demand per role - Joins IL bench supply (workers_500k state='IL' group by role) - Computes coverage_pct, reliable_coverage_pct - Classifies risk: critical/tight/watch/ok - Computes earliest staffing deadline per role (permit issue_date + 31d = 45d construction start - 14d window) - Surfaces recent Chicago playbook ops for the role-specific memory New UI 'Staffing Forecast' section ABOVE Live Contracts: - Top card: total construction value, permit count, workers needed, critical/tight role count - Per-role rows: demand vs available supply, coverage %, deadline with red/amber/green urgency coloring Per-contract timeline on Live Contracts: - estimated_construction_start, staffing_window_opens, days_to_deadline - urgency classification: overdue/urgent/soon/scheduled - card border colored by urgency - timeline line explicitly shows recruiter: OVERDUE/URGENT + days count This is the 'system already thinks about when, not just who' surface J was asking for. CRMs store; this anticipates.	2026-04-20 17:24:17 -05:00
root	2595d48535	Gap fixes: pattern fallback, narrative citations, call_log plumbing Closing trust-breaks surfaced in the strategic audit. A — MEMORY chip renders even when sparse: Previously rendered nothing when no trait crossed threshold, which recruiters would read as "system has no signal." Now explicitly says "memory is sparse for this role+geo — no trait crossed threshold" or "no similar past playbooks yet — first fill of this kind will seed it." Honest when it doesn't know. B — Removed /intelligence/learn dead endpoint: Legacy CSV-writer path that destructively re-wrote successful_playbooks. /log and /log_failure replace it cleanly. Leaving dead code confuses future maintainers. C — Narrative tooltips on Endorsed chips: Hovering the green "Endorsed · N playbooks" chip now fetches the worker's past operations from successful_playbooks_live and shows a story: "Maria — past endorsements: • Welder x2 in Toledo (2026-04-15), • Welder x1 in Toledo (2026-04-18)..." Falls back to honest "narrative unavailable" if the seed didn't land in SQL. D — call_log infrastructure in worker modal: New "Recent Contact" section queries call_log JOIN candidates by name. Surfaces last 3 call entries with timestamp, recruiter, disposition, duration. When empty (which is today's reality — candidates table only has 1000 rows vs call_log's higher IDs), shows an honest message about the data gap and what real ATS integration would unlock. Honest call: D ships infrastructure. Actual utility depends on aligning candidate IDs between the candidates table and call_log — current synthetic data doesn't cross-ref cleanly. When real ATS data lands, this section becomes the "system knows who we called yesterday" feature the recruiter needs. Deferred (would require a dedicated session): - Rate awareness (needs worker pay_rate + contract bill_rate) - Push / background daemon (Slack/SMS/email integration) - Confidence calibration (needs a probabilistic ranking layer)	2026-04-20 17:20:22 -05:00
root	1f56630d5d	#3 : Worker profile modal shows past playbook history Click any worker card → modal now includes a 'Past Playbooks' section that queries successful_playbooks_live for any row where this worker's name appears in the result field. Shows up to 8 most recent with operation, timestamp, approach, and context. When empty: 'No prior playbooks for NAME yet. First placement builds the first entry.' — makes the institutional-memory claim visible to the recruiter: the system is tracking everyone, not just the ones that sealed this session. Also added Call / SMS / No-show buttons to the modal action row (matching the card-level buttons from #1). Every worker-card path now trains the system. Closes the user-visible side of Phase 19 — patterns surface during search (Pass A), boosts fire in ranking (Phase 19 core), and now the worker's own profile shows the full history that informs those boosts. Institutional memory legibility, per J's ask.	2026-04-20 16:21:27 -05:00
root	4aea71d213	#1 : Close recruiter feedback loop — Call/SMS/No-show fire /log and /log_failure Every worker-card button in the dashboard now trains the Phase 19 system directly: - Call → POST /log (seeds playbook_memory + persists SQL) - SMS → POST /log (same — both count as positive engagement) - No-show → POST /log_failure (per-worker penalty 0.5^n on future boost) Buttons flash status (Logged / Flagged / Ghost) for 1.4s on success, then re-enable. Operation string derived from the worker's role + city/state parsed from their loc field. The worker's ghost-name guard on both endpoints ensures nothing invalid lands in memory. Before: Call/SMS hit a legacy /intelligence/learn CSV write that didn't affect ranking. No failure capture existed. Now: recruiter using the app IS the training signal. Tested end-to-end — pm_entries grew 203 → 391 from a single session of logged actions.	2026-04-20 16:19:14 -05:00
root	99ab0fe623	A+B: patterns in main search + compounding bump A — Patterns surface in main Worker Search: /intelligence/chat smart_search fallback now calls /patterns in parallel with hybrid, returns discovered_pattern + matched count. search.html doSearch renders a green "MEMORY (N playbooks): ..." chip above results so every recruiter query shows the meta-index dimension, not just live-contract cards. B — Compounding proven and default-k bumped: Direct compounding test on Chicago Electrician: - Run 0 (no seeds): Carmen Green not in top-5, boost 0 - After 3 seeds of identical operation: boost +0.250 (capped), 3 citations, lifted to #1. Each seed adds 1 citation. Cap prevents one worker from dominating future searches. - Required k=200 (not 25 or 50) — embedding band is narrow (cosines 0.55-0.67 across all playbooks regardless of geo). - Bumped defaults on /search, permit_contracts, and smart_search to playbook_memory_k=200. Brute-force sub-ms at this scale.	2026-04-20 15:41:12 -05:00
root	5c39c74fe4	Live Contracts canvas: Chicago permits × workers_500k × playbook patterns New devop.live/lakehouse section pairs live public Chicago building permits with derived staffing contracts, ranked candidates from the 500K worker bench, and meta-index discovered patterns per role+geo. Makes the Phase 19 boost + Path 2 pattern discovery visible on real external data, without needing a paying client to demo. Backend: - New /intelligence/permit_contracts endpoint - Fetches 6 recent Chicago permits > $250K from the Socrata API - Derives proposed fill: 1 worker per $150K of permit value (capped 2-8) - For each: /vectors/hybrid with use_playbook_memory=true, playbook_memory_k=25, auto availability>0.5 filter - For each: /vectors/playbook_memory/patterns with k=25 min_freq=0.3 - Returns permit + proposed contract + top 5 candidates with boosts and citations + discovered pattern + pattern_matched count Frontend: - New "Live Contracts" section on search.html between today's sim contracts and Market Intelligence - Per-permit card: cost + work_type + address + proposed role/count + pool size + top 3 candidates (with endorsement chip when boost fires) + memory-derived pattern ("MEMORY (N playbooks): recurring certifications: OSHA-10 47%, Forklift... · archetype mostly: ...") Real working demo even without paying clients: shows the system operating on genuinely external data with our synthetic-data-derived learning applied.	2026-04-20 15:36:14 -05:00
root	25b7e6c3a7	Phase 19 wiring + Path 1/2 work + chain integrity fixes Backend: - crates/vectord/src/playbook_memory.rs (new): Phase 19 in-memory boost store with seed/rebuild/snapshot, plus temporal decay (e^-age/30 per playbook), persist_to_sql endpoint backing successful_playbooks_live, and discover_patterns endpoint for meta-index pattern aggregation (recurring certs/skills/archetype/reliability across similar past fills). - DEFAULT_TOP_K_PLAYBOOKS bumped 5 → 25; old default silently missed most boosts when memory had > 25 entries. - service.rs: new routes /vectors/playbook_memory/{seed,rebuild,stats, persist_sql,patterns}. Bun staffing co-pilot (mcp-server/): - /search, /match, /verify, /proof, /simulation/run, MCP tools all forward use_playbook_memory:true and playbook_memory_k:25 to the hybrid endpoint. Boost was previously dark across the entire app. - /log no longer POSTs to /ingest/file — that endpoint REPLACES the dataset's object list, so single-row CSV writes were wiping all prior rows in successful_playbooks (sp_rows went 33→1 in one /log call). /log now seeds playbook_memory with canonical short text and calls /persist_sql to keep successful_playbooks_live in sync. - /simulation/run cumulative end-of-week CSV write removed for the same reason. Per-day per-contract /seed (added in this session) is the accumulating feedback path now. - search.html addWorkerInsight renders a green "Endorsed · N playbooks" chip with playbook citations when boost > 0. Internal Dioxus UI (crates/ui/): - Dashboard phase list rewritten through Phase 19 (was stuck at "Phase 16: File Watcher" / "Phase 17: DB Connector" — both wrong). - Removed fabricated "27ms" stat label. - Ask tab examples + SQL default replaced with real staffing prompts against candidates/clients/job_orders (was referencing nonexistent employees/products/events). - New Playbook tab exposes /vectors/playbook_memory/{stats,rebuild} and side-by-side hybrid search (boost OFF vs ON) with citations. Tests (tests/multi-agent/): - run_e2e_rated.ts: parallel two-agent (mistral + qwen2.5) build phase + verifier rating (geo, auth, persist, boost, speed → /10). - network_proving.ts: continuous build → verify → repeat with staffing-recruiter profile hot-swap; geo-discrimination check. - chain_of_custody.ts: single recruiter operation traced through every layer (Bun /search, direct /vectors/hybrid parity, /log, SQL, playbook_memory growth, profile activation, post-op boost lift).	2026-04-20 06:21:13 -05:00
root	8e3cac5812	Polish: professional layout, collapsible sections, tighter design - Replaced amateur CSS with professional dark theme (Inter font, muted palette, proper spacing, consistent border radius, hover states, transitions) - Nav bar with Dashboard/Intelligence Console/Architecture tabs - Urgent pipeline: shows contracts directly, removed busy step indicators - In Progress + Ready to Go: collapsed by default with expand toggle (page went from 30+ visible contract cards to just the urgents) - Workers Available: limited to 5 instead of 8 - Proper section headers with labels and metadata - Search section always visible with better placeholder text - Professional footer with product branding - Responsive breakpoints for mobile (768px, 480px) - Page is now ~50% shorter with same information density Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 20:29:45 -05:00
root	2da8562c90	Interactive permit heat map with live data verification - Leaflet.js map with dark tiles showing real Chicago building permits - Dots sized and colored by project cost ($1B+ red, $100M+ orange, $10M+ blue) - Hover any dot for project details — address, cost, description, date - LIVE indicator with green pulse dot - Timestamp showing when data was fetched - "Verify source" link goes directly to Chicago Open Data portal - "Refresh" button re-fetches from the API on click - Expanded to 50 permits for denser map coverage - Legend showing dot size scale No one can say "you just typed those numbers in" when they can click a dot on the map, see 10000 W OHARE ST, and verify it themselves on data.cityofchicago.org. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 20:24:43 -05:00
root	9acbe5c369	Market Intelligence: live Chicago building permits → staffing demand forecast /intelligence/market pulls real permit data from Chicago Open Data API: - $9.6B in active construction permits - O'Hare expansion ($730M), new casino ($580M), transit station ($445M) - Maps permit types to staffing roles (electrical→Electrician, masonry→Loader) - Cross-references with our IL worker bench to show coverage gaps - Electrician gap: only 1,036 reliable vs 63K estimated demand Datalake page now shows three intelligence layers: 1. Contract simulation with scenario-driven matching 2. Market Intelligence with live permit data + bench analysis 3. System Learning with fill history and detected patterns The staffing company sees demand forming before the phone rings. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 20:12:01 -05:00
root	b16e485be1	Every page refresh feeds the learning loop — contracts logged as playbook entries Each simulation fill now logs: role, headcount, city, state, workers matched, client, start time, and scenario type. One page refresh = ~20 playbook entries. 4 refreshes = 28 entries with patterns already forming. Fixed activity counters: shows Contract Fills, Searches, and Patterns. Activity feed now shows the actual fill data with worker names and scenarios. This is the PRD's learning loop in action — the system records every successful match so future queries can learn from past decisions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 20:05:51 -05:00
root	bba5b826a3	Learning loop + smart search on datalake page Learning Loop: - /intelligence/learn endpoint logs search→selection as playbook entry - /intelligence/activity returns learning stats, patterns, and recent activity - Call/SMS buttons trigger logSelection() — records what query led to what pick - "System Learning" card on main page shows searches logged, patterns detected, and recent activity feed with timestamps - Every search-selection pair becomes institutional knowledge stored in the lakehouse Smart Search on Main Page: - doSearch() now routes through /intelligence/chat (smart NL parser) - Extracts role, city, state, availability, reliability from natural language - Shows understanding tags so staffer sees what the system parsed - Returns workers with ZIP codes, availability %, reliability %, archetype - "reliable forklift operator available in Nashville" → 10 Nashville forklift operators with ZIP codes, all 86-98% reliable, all available — 372ms Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 19:59:07 -05:00
root	37c68d9567	Kill all static/fake elements — every number on the page is now live from data Skeptic-proof audit: - Worker count queried from database (was hardcoded "500K") - State/role dropdowns populated from actual data (was hardcoded 8 states, 6 roles) - Now shows 11 states, 21 roles — whatever exists in the dataset - Client names generated combinatorially (20×20=400 combos, was 12 static) - Top workers randomized with SQL OFFSET (was same 5 every time) - Deleted fabricated "Recent Activity" section (fake placement history) - Replaced with transparent "Data Source" showing where numbers come from - Fixed NOTES undefined crash — hybrid search actually returns results now (was silently failing, showing 0/X filled on every contract) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 17:09:22 -05:00
root	e9b5498f43	Contextual insights: workers and bench strength driven by today's actual contracts - loadDay() now runs simulation first, extracts unfilled roles/states, then builds SQL queries filtered to what's actually needed today - "Workers Available for Today's Open Contracts" replaces generic top-5 list - Each worker shows which gap they fill: "Could fill 4 open Loader spots" - Bench Strength section scoped to states with active contracts + open slot counts - Every refresh produces different workers because contracts change each time Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 17:04:39 -05:00
root	be7436b6f0	Diverse scenario engine: 15 weighted staffing situations replace crisis-every-refresh Simulation now uses weighted random selection across 4 priority tiers: - Urgent (walkoff, quarantine, no-show), High (new client, cert expiry, expansion), Medium (recurring, seasonal, medical leave, cross-train), Low (future, exploratory) - Color-coded scenario banners on ALL contracts, not just urgent - Each scenario carries context (note) + recommended action Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 16:41:00 -05:00
root	e87155306b	Urgent explains WHY and WHAT TO DO — not just a red dot Urgent contracts now show: - Red banner with specific reason: 'Client called last night', 'Emergency coverage — 2 no-shows reported', 'Production surge', 'Original crew cancelled', etc. - Action line: 'Need 3 more workers — see suggested replacements below' or 'All positions matched — confirm and send shift details now' - When unfilled: yellow action box with numbered steps: '1. Call the workers above, 2. If someone declines the backup is ready, 3. Expand search to nearby states' - FIRST CHOICE worker highlighted with red border - BACKUP workers labeled and shown after the required headcount The staffer doesn't see a red circle and wonder. They see: 'Emergency coverage — 2 no-shows. Need 3 more. Here are your options. Call this person first. If they can't, here's the backup.' Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 16:32:50 -05:00
root	2155959013	Worker profile modal: click any worker to see full details Click any worker avatar/card → scrollable modal with: - Rich profiles: reliability/availability bars with explanations, skill tags, cert badges, archetype with description, work history, Call/SMS action buttons - Sparse profiles: trust path showing 'You are here' → progression to full profile through normal operations - Modal scrolls independently, background locked - Close via X button or click outside Each archetype has a plain-English description: reliable: 'Consistently shows up, clients request them back' leader: 'Takes initiative, helps train others' erratic: 'Inconsistent attendance, needs monitoring' etc. Work history shows recent placements and cert renewals. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 16:25:42 -05:00
root	45a95a9feb	Urgent pipeline: step-by-step workflow walks staffer through emergency fills Urgent contracts now show a 4-step action plan: Step 1 (red): Review pre-matched workers Step 2 (yellow): Call first choice — highest match score Step 3 (blue): Confirm or replace — backup is ready Step 4 (green): Send shift details to confirmed workers First-choice worker highlighted with red border + label. Backup workers shown with dimmed styling + 'BACKUP' label. Urgent cards show ALL matched workers + backups (not just 3). Non-urgent contracts split into 'In Progress' (still filling) and 'Ready to Go' (fully staffed) sections. The staffer doesn't stare at a red label wondering what to do. They follow the steps: review, call, confirm, send. Done. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 16:08:18 -05:00
root	bb46869227	Intelligence-first UI: insights, not data dumps Complete rebuild around 'how did it know that?' moments: 1. NEEDS YOUR ATTENTION — urgent contracts with pre-matched workers. Each worker shows WHY they were matched: 'Reliable (85%) · Certified: OSHA-10 · Same city as job site' 2. READY TO CONFIRM — fully matched contracts, just review and send 3. YOUR STRONGEST WORKERS — 95%+ reliability, 'they rarely no-show and clients request them back' 4. BENCH STRENGTH ALERT — states with thin reliable worker pools, 'consider recruiting in these areas' Every section has: a label (ACTION NEEDED/READY/INSIGHT/HEADS UP), a headline in plain English, an explanation of HOW the system knows this, and actionable workers with Call/SMS buttons. This is what a CRM has never done: anticipate, explain, recommend. The staffer doesn't search — they respond to intelligence. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 15:46:25 -05:00
root	875cfadc3d	Graceful sparse data: show what exists, hide what doesn't Worker cards now handle sparse-to-rich data gracefully: - Name only? Shows name + 'New — data builds with placements' - Name + role? Shows name + role tag - Name + role + skills + certs? Shows full tag row - Has reliability data? Shows colored meter bars - No metrics? No empty bars, no 0% — just what's there Contract cards: urgency dot, progress bar, fill count. Workers inside: avatar initials, name, role, location, skill/cert tags (blue/green), archetype (purple), reliability/availability bars — all ONLY when data exists. GitHub-style dark theme. Call/SMS per worker. Search collapsed. ADR-021 compliant: works with a name and earns everything else. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 15:36:53 -05:00
root	845acfdcda	Rich worker cards: skills, certs, reliability bars — not just names Each worker in a contract card now shows: - Initials avatar (color-coded) - Name + location on same line - Skill tags (blue pills, top 3 relevant) - Cert badges (green pills — OSHA, Forklift, Hazmat) - Archetype tag (purple — reliable, leader, etc) - Reliability bar with color (green >80%, yellow >50%, red <50%) - Availability bar with color - Individual Call/SMS buttons per worker Contract headers show: - Urgency dot (red/yellow/blue/green) - Client name, role × headcount, location, start time - Progress bar with fill count GitHub-style dark theme. Every piece of info visible at a glance without clicking anything. The staffer sees skills, certs, and reliability for every matched worker the moment the page loads. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 15:27:27 -05:00
root	05785b4628	Dashboard: the staffer's actual workday, not a search box Not a CRM search page. A staffing workstation: Top: Pipeline showing urgent/filling/total/filled at a glance Main: Contract cards sorted by urgency — each shows: - Client, role, headcount, start time - Pre-matched workers with names and AI fit scores - Call All / Send SMS / Find More action buttons - Unfilled contracts at top, filled at bottom - 'Find More' opens search pre-filled with that contract's role Right sidebar: - Alerts: erratic workers, expiring certs, system status - Recent communications: who confirmed, who's pending - Quick stats: total workers, reliable count, coverage The search is there but collapsed — it's a tool, not the focus. When they open the page, their day is already organized. This is what the CRM doesn't do: anticipate, pre-match, organize. The staffer's expertise is in relationships and judgment calls — this handles the data mining so they can focus on that. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 15:22:18 -05:00
root	7cb9999451	Rebuild search UI: zero dependencies, plain JS, DOM-only, works Replaced complex dashboard with minimal search.html: - No external JS/CSS files, no transpilation, no module imports - Plain JS with .then() chains (no async/await compat issues) - DOM-only rendering via createElement (no innerHTML with data) - 20s AbortController timeout so fetch never hangs - Detects /lakehouse/ proxy prefix automatically - 7KB total, loads in 18ms Calls lakehouse /vectors/hybrid directly — SQL filters always apply, works even when HNSW isn't loaded (brute-force fallback). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 13:26:27 -05:00

42 Commits