lakehouse/mcp-server/samples/staffing_roster_sample.csv
root 23eb04a145 Onboarding wizard — ingest any staffing CSV in 3 steps
New /onboard page. Client-facing wizard for getting real data into
the system without engineering help.

Flow:
1. Drop a CSV (or click 'Use the sample as my data' — ships a 25-row
   realistic staffing roster under /samples/staffing_roster_sample.csv)
2. Browser parses client-side. Columns auto-typed (text/int/decimal/
   date). PII flagged by name hint AND content regex (emails, phones).
   First rows previewed. Read-only — nothing written yet.
3. Name the dataset (lowercase+underscores). Commit.
4. Post-commit: dataset is live. Shows 4 next steps the operator can
   take (SQL query, vector index, dashboard search, playbook training).

Backend:
- /onboard serves onboard.html
- /samples/*.csv serves CSV files from mcp-server/samples/ with
  filename validation (only [a-zA-Z0-9_-.]+.csv, prevents path traversal)
- /onboard/ingest forwards multipart/form-data to gateway /ingest/file
  preserving the boundary. The generic /api/* passthrough breaks
  multipart because it reads as text and forwards as JSON; this route
  uses arrayBuffer + original Content-Type.

Verified end-to-end: upload sample roster (25 rows, 12 columns) →
parse in browser → show columns + PII flags + preview → commit →
gateway writes Parquet, registers in catalog → immediately queryable:
  SELECT * FROM onboard_demo2 LIMIT 3
  → Sarah Johnson, Forklift Operator, Chicago, IL, 0.92
Round-trip <1 second.

Nav updated on all pages to link Onboard. Shipped with a sample CSV
so the full flow is demonstrable without real client data.

When a real client shows up, same path — they upload their CSV.
No engineering ticket, no code change, no schema pre-definition.

Security: sample filename regex prevents path traversal. CSV parse
is client-side pure JS (no DOM injection). Commit uses existing
/ingest/file validation (schema fingerprint, PII server-side,
content-hash dedup).
2026-04-20 18:13:56 -05:00

3.8 KiB

1worker_idnamerolecitystateemailphoneskillscertificationsavailabilityreliabilityarchetype
2W-1001Sarah JohnsonForklift OperatorChicagoILsarah.johnson@example.com(312) 555-0101pallet jack|hazmat|loading dockOSHA-10|Forklift0.920.88reliable
3W-1002Michael ChenWelderToledoOHm.chen@example.com(419) 555-0234TIG|MIG|blueprint reading|grinderOSHA-10|AWS D1.10.710.94specialist
4W-1003Maria RodriguezWarehouse AssociateChicagoILmaria.r@example.com(312) 555-0311inventory|RF scanner|pick-to-lightOSHA-100.850.80reliable
5W-1004David ParkMachine OperatorDetroitMIdpark@example.com(313) 555-0412CNC|gauge R&R|lean manufacturingOSHA-10|First Aid0.600.91specialist
6W-1005Jennifer LopezLoaderChicagoILj.lopez@example.com(312) 555-0501loading dock|team lead|cold storageOSHA-10|Forklift0.950.76communicator
7W-1006Robert WilliamsMaintenance TechMilwaukeeWIrwilliams@example.com(414) 555-0612electrical|PLC|hydraulics|CMMSOSHA-10|LOTO0.450.93specialist
8W-1007Amanda TaylorQuality TechToledoOHa.taylor@example.com(419) 555-0701ISO 9001|calibration|root cause analysisOSHA-10|Six Sigma Green0.800.87leader
9W-1008Carlos MendozaElectricianChicagoILc.mendoza@example.com(312) 555-0808conduit|motor controls|troubleshootingOSHA-30|NEC0.880.96specialist
10W-1009Kim NguyenForklift OperatorDetroitMIk.nguyen@example.com(313) 555-0915pallet jack|shipping|team leadOSHA-10|Forklift0.730.82flexible
11W-1010James O'BrienWelderMilwaukeeWIjobrien@example.com(414) 555-1021TIG|pipe welding|confined spaceOSHA-10|AWS D1.10.900.79reliable
12W-1011Priya PatelProduction WorkerChicagoILpriya.p@example.com(312) 555-1118line work|first article|labelingOSHA-100.660.85reliable
13W-1012Thomas AndersonMaterial HandlerToledoOHtanderson@example.com(419) 555-1225RF scanner|pallet jack|receivingOSHA-10|Forklift0.820.77flexible
14W-1013Lisa NakamuraWarehouse AssociateDetroitMIl.nakamura@example.com(313) 555-1312inventory|Excel|packagingOSHA-100.770.89communicator
15W-1014Brandon MooreMachine OperatorMilwaukeeWIb.moore@example.com(414) 555-1405CNC|SPC|conveyor opsOSHA-100.560.94specialist
16W-1015Emily ZhangAssemblerChicagoILemily.z@example.com(312) 555-1502assembly|gauge R&R|line leadOSHA-10|Six Sigma Yellow0.910.81leader
17W-1016Marcus JohnsonForklift OperatorToledoOHmjohnson@example.com(419) 555-1609pallet jack|hazmat|overhead craneOSHA-10|Forklift|Hazmat0.680.88reliable
18W-1017Anita BrooksLine LeadDetroitMIa.brooks@example.com(313) 555-1715team lead|training|SPCOSHA-30|Six Sigma Green0.520.97leader
19W-1018Dmitri VolkovWelderChicagoILd.volkov@example.com(312) 555-1820TIG|MIG|blueprint readingOSHA-10|AWS D1.1|Confined Space0.840.83specialist
20W-1019Rachel KimLoaderMilwaukeeWIr.kim@example.com(414) 555-1911loading dock|first aid|bilingualOSHA-10|First Aid0.790.84communicator
21W-1020Samuel ParkMaintenance TechToledoOHspark@example.com(419) 555-2012electrical|PLC|troubleshooting|CMMSOSHA-10|LOTO0.700.92specialist
22W-1021Jordan WilliamsQuality TechDetroitMIj.williams@example.com(313) 555-2117ISO 9001|calibration|SPCOSHA-10|Six Sigma Green0.830.86leader
23W-1022Natalia SotoWarehouse AssociateChicagoILn.soto@example.com(312) 555-2221RF scanner|inventory|team leadOSHA-100.890.90communicator
24W-1023Henry ChenMachine OperatorMilwaukeeWIh.chen@example.com(414) 555-2318CNC|conveyor ops|root causeOSHA-100.640.93specialist
25W-1024Ava MartinezForklift OperatorToledoOHa.martinez@example.com(419) 555-2411pallet jack|bilingual|cold storageOSHA-10|Forklift0.870.80flexible
26W-1025Tyler RodriguezProduction WorkerChicagoILt.rodriguez@example.com(312) 555-2515line work|packaging|quality inspectionOSHA-100.740.86reliable