lakehouse/tests/multi-agent/playbooks/scenario-2026-04-21T04-00-00/report.md

# Scenario retrospective — Midway Distribution — Joliet DC Ramp, 2026-04-26

Executor: `qwen3.5:latest`   Reviewer: `qwen3:latest`   Draft: `qwen2.5:latest`   Overview(T3): `gpt-oss:120b`
Prior lessons loaded into executor context: **3** (from 2026-05-10, 2026-05-02, 2026-05-01)

## Events

| At | Kind | Role / Count | Pool | Fills | Turns | Dur(s) | Cites | Gaps |
|---|---|---|---|---|---|---|---|---|
| 07:00 | baseline_fill | Warehouse Associate × 5 | 821 | ✓ 5 | 2 | 16.5 | 5 | 4 |
| 10:00 | recurring | Forklift Operator × 3 | 1178 | ✓ 3 | 2 | 14.1 | 8 | 3 |
| 12:30 | expansion | Picker × 4 | - | ✗ 0 | 0 | 23.5 | 0 | 1 |
| 15:00 | misplacement | Forklift Operator × 1 | 1178 | ✓ 1 | 3 | 17.3 | 9 | 1 |

## Final roster

| Worker | Booked | Role | City, ST | Status |
|---|---|---|---|---|
| undefined Mary H. King | 07:00 | Warehouse Associate | Joliet, IL | no_show |
| undefined Emily E. Powell | 07:00 | Warehouse Associate | Joliet, IL | confirmed |
| undefined Nathan Y. Miller | 07:00 | Warehouse Associate | Joliet, IL | confirmed |
| undefined Andre Davis | 07:00 | Warehouse Associate | Joliet, IL | confirmed |
| undefined Kelly L. Nelson | 07:00 | Warehouse Associate | Joliet, IL | confirmed |
| undefined Joshua Taylor | 10:00 | Forklift Operator | Joliet, IL | confirmed |
| undefined Lauren F. Adams | 10:00 | Forklift Operator | Joliet, IL | confirmed |
| undefined Joyce D. Reyes | 10:00 | Forklift Operator | Joliet, IL | confirmed |
| undefined Joshua Taylor | 15:00 | Forklift Operator | Joliet, IL | confirmed |

## Gap signals

### double_book
- **07:00** — undefined Emily E. Powell already booked for 07:00
- **07:00** — undefined Nathan Y. Miller already booked for 07:00
- **07:00** — undefined Andre Davis already booked for 07:00
- **07:00** — undefined Kelly L. Nelson already booked for 07:00
- **10:00** — undefined Joshua Taylor already booked for 07:00
- **10:00** — undefined Lauren F. Adams already booked for 07:00
- **10:00** — undefined Joyce D. Reyes already booked for 07:00
- **15:00** — undefined Joshua Taylor already booked for 07:00

### drift_or_tool
- **12:30** — aborted — 3 consecutive drift flags

### fairness
- _cross-event_ — Mary H. King (undefined) booked 8 times today

### write_through_audit
- _post-run_ — playbook_memory has 1695 entries (ran 4 events, expected ≥ 3 new entries from this run)

## Workers touched across the week

10 distinct workers made it through to a decision. Every one is accounted for below — no-shows flagged, rebookings noted, everyone visible.

| Worker ID | Name | Events | Outcome |
|---|---|---|---|
| W500K-37707 | Mary H. King | 07:00 baseline_fill | booked |
| W500K-23612 | Emily E. Powell | 07:00 baseline_fill | booked |
| W500K-16430 | Nathan Y. Miller | 07:00 baseline_fill | booked |
| W500K-2245 | Andre Davis | 07:00 baseline_fill | booked |
| W500K-23801 | Kelly L. Nelson | 07:00 baseline_fill | booked |
| W500K-2372 | Joshua Taylor | 10:00 recurring | booked |
| W500K-46701 | Lauren F. Adams | 10:00 recurring | booked |
| W500K-41031 | Joyce D. Reyes | 10:00 recurring | booked |
| 2372 | Joshua Taylor | 15:00 misplacement | booked |
| undefined | Mary H. King | 07:00 | no_show |

## Discovered patterns (meta-index)

What the system identified across semantically-similar past fills as each event ran:

- **07:00 baseline_fill** (Warehouse Associate): Across 25 similar past playbooks (20 workers examined) · recurring certifications: Forklift (40%) · archetype mostly: leader · reliability median 0.80 (range 0.34–1.00)
- **10:00 recurring** (Forklift Operator): Across 25 similar past playbooks (20 workers examined) · recurring certifications: Forklift (40%) · archetype mostly: reliable · reliability median 0.80 (range 0.34–1.00)
- **12:30 expansion** (Picker): —
- **15:00 misplacement** (Forklift Operator): Across 25 similar past playbooks (20 workers examined) · recurring certifications: Forklift (40%) · archetype mostly: leader · reliability median 0.80 (range 0.34–1.00)

## Narrative

- 3/4 events reached consensus.
- Final roster: 9 bookings across 1 distinct workers.
- Workers touched (booked, failed, or otherwise decided): 10.
- Playbook citations across the day: 22 (proof the feedback loop fired across events).
- Dropped events: 12:30 expansion.