Fact Extraction Runbook
Operational runbook for diagnosing and resolving issues with AI fact extraction, evergreen generation, challenge content, and the seed pipeline.
Note: Content is pending — this is a stub created from the runbook template.
Overview
This runbook covers the fact extraction pipeline: AI-driven fact extraction from story clusters, evergreen fact generation, challenge content generation, and seed pipeline explosion.
Owner: fact-engineer
Escalation Path: architect-steward → on-call engineer
Quick Reference
| Metric | Healthy | Warning | Critical |
|---|---|---|---|
| Extraction queue depth | < 20 | 20-100 | > 100 |
| AI cost (daily) | < $3 | $3-5 | > $5 |
| Extraction success rate | > 90% | 70-90% | < 70% |
Diagnostic Decision Tree
Are facts being extracted?
├── No → Check worker-facts health endpoint
│ ├── Worker down → Restart worker
│ └── Worker up → Check EXTRACT_FACTS queue
│ ├── Queue empty → Check upstream clustering
│ └── Queue has messages → Check AI provider status
└── Yes → Check quality
├── Low notability scores → Review extraction prompts
├── Schema validation failures → Check fact_record_schemas
└── High AI costs → Check model tier routing
Common Issues
To be documented as issues are encountered.