lakehouse/data/_catalog/manifests/e4b8441f-d729-4465-91fb-2ed5f481e65d.json
root 6a532cb248 Background job system for embedding — fixes 100K timeout
- JobTracker: create/update/complete/fail jobs with progress tracking
- POST /vectors/index now returns immediately with job_id (HTTP 202)
- Embedding runs in tokio::spawn background task
- GET /vectors/jobs/{id} returns live progress (chunks embedded, rate, ETA)
- GET /vectors/jobs lists all jobs
- Progress logged every 100 batches with chunks/sec and ETA
- 100K embedding job running successfully at 44 chunks/sec
- System stays responsive during embedding (queries in 23ms)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-27 09:03:07 -05:00

15 lines
380 B
JSON

{
"id": "e4b8441f-d729-4465-91fb-2ed5f481e65d",
"name": "clients",
"schema_fingerprint": "auto",
"objects": [
{
"bucket": "data",
"key": "datasets/clients.parquet",
"size_bytes": 21971,
"created_at": "2026-03-27T14:00:35.670181596Z"
}
],
"created_at": "2026-03-27T14:00:35.670184688Z",
"updated_at": "2026-03-27T14:00:35.670184688Z"
}