lakehouse/tests/multi-agent/playbooks/scenario-2026-04-21T02-13-00/report.md

# Scenario retrospective — Midway Distribution, 2026-05-10

Executor: `qwen3.5:latest`   Reviewer: `qwen3:latest`   Draft: `qwen2.5:latest`   Overview(T3): `gpt-oss:20b`
Prior lessons loaded into executor context: **3** (from 2026-05-01, 2026-04-24, 2026-04-22)

## Events

| At | Kind | Role / Count | Pool | Fills | Turns | Dur(s) | Cites | Gaps |
|---|---|---|---|---|---|---|---|---|
| 11:00 | baseline_fill | Machine Operator × 2 | 1004 | ✓ 2 | 2 | 17.4 | 0 | 2 |
| 17:00 | expansion | Shipping Clerk × 2 | 389 | ✓ 2 | 2 | 11.7 | 0 | 3 |
| 09:30 | misplacement | Packer × 1 | - | ✗ 0 | 0 | 19.2 | 0 | 1 |

## Final roster

| Worker | Booked | Role | City, ST | Status |
|---|---|---|---|---|
| undefined Samantha S. Mendoza | 11:00 | Machine Operator | Chicago, IL | no_show |
| undefined Olivia E. Long | 11:00 | Machine Operator | Chicago, IL | confirmed |
| undefined Kyle K. Lee | 17:00 | Shipping Clerk | Chicago, IL | confirmed |
| undefined Ruth Murphy | 17:00 | Shipping Clerk | Chicago, IL | confirmed |

## Gap signals

### double_book
- **11:00** — undefined Olivia E. Long already booked for 11:00
- **17:00** — undefined Kyle K. Lee already booked for 11:00
- **17:00** — undefined Ruth Murphy already booked for 11:00

### artifact
- **11:00** — undefined is not an object (evaluating 'f.reason.slice')
- **17:00** — undefined is not an object (evaluating 'f.reason.slice')

### drift_or_tool
- **09:30** — aborted — 3 consecutive drift flags

### fairness
- _cross-event_ — Samantha S. Mendoza (undefined) booked 3 times today

### write_through_audit
- _post-run_ — playbook_memory has 1560 entries (ran 3 events, expected ≥ 2 new entries from this run)

## Workers touched across the week

5 distinct workers made it through to a decision. Every one is accounted for below — no-shows flagged, rebookings noted, everyone visible.

| Worker ID | Name | Events | Outcome |
|---|---|---|---|
| W500K-37224 | Samantha S. Mendoza | 11:00 baseline_fill | booked |
| W500K-38796 | Olivia E. Long | 11:00 baseline_fill | booked |
| W500K-47824 | Kyle K. Lee | 17:00 expansion | booked |
| W500K-7780 | Ruth Murphy | 17:00 expansion | booked |
| undefined | Samantha S. Mendoza | 11:00 | no_show |

## Discovered patterns (meta-index)

What the system identified across semantically-similar past fills as each event ran:

- **11:00 baseline_fill** (Machine Operator): Across 25 similar past playbooks (30 workers examined) · recurring certifications: OSHA-10 (47%) · archetype mostly: communicator · reliability median 0.79 (range 0.19–1.00)
- **17:00 expansion** (Shipping Clerk): Across 25 similar past playbooks (30 workers examined) · recurring certifications: OSHA-10 (47%) · archetype mostly: communicator · reliability median 0.79 (range 0.19–1.00)
- **09:30 misplacement** (Packer): —

## Narrative

- 2/3 events reached consensus.
- Final roster: 4 bookings across 1 distinct workers.
- Workers touched (booked, failed, or otherwise decided): 5.
- Playbook citations across the day: 0 (proof the feedback loop fired across events).
- Dropped events: 09:30 misplacement.