Ingestion Runbook
Operational runbook for diagnosing and resolving issues with the news ingestion pipeline (fetch, cluster, image resolution).
Note: Content is pending — this is a stub created from the runbook template.
Overview
This runbook covers the ingestion pipeline: news API fetching, article deduplication, story clustering, and image resolution.
Owner: ingest-engineer
Escalation Path: architect-steward → on-call engineer
Quick Reference
| Metric | Healthy | Warning | Critical |
|---|---|---|---|
| Ingestion queue depth | < 50 | 50-200 | > 200 |
| Cluster job latency | < 30s | 30-120s | > 120s |
| Image resolution rate | > 80% | 50-80% | < 50% |
Diagnostic Decision Tree
Is the ingestion pipeline processing?
├── No → Check worker-ingest health endpoint
│ ├── Worker down → Restart worker
│ └── Worker up → Check queue connectivity (Upstash Redis)
└── Yes → Check for specific failures
├── Articles not appearing → Check news API keys and rate limits
├── Clustering stuck → Check CLUSTER_STORIES queue
└── Images failing → Check Unsplash/Pexels API keys
Common Issues
To be documented as issues are encountered.