Ingestion Runbook

Operational runbook for diagnosing and resolving issues with the news ingestion pipeline (fetch, cluster, image resolution).

Note: Content is pending — this is a stub created from the runbook template.


Overview

This runbook covers the ingestion pipeline: news API fetching, article deduplication, story clustering, and image resolution.

Owner: ingest-engineer

Escalation Path: architect-steward → on-call engineer


Quick Reference

MetricHealthyWarningCritical
Ingestion queue depth< 5050-200> 200
Cluster job latency< 30s30-120s> 120s
Image resolution rate> 80%50-80%< 50%

Diagnostic Decision Tree

Is the ingestion pipeline processing?
├── No → Check worker-ingest health endpoint
│   ├── Worker down → Restart worker
│   └── Worker up → Check queue connectivity (Upstash Redis)
└── Yes → Check for specific failures
    ├── Articles not appearing → Check news API keys and rate limits
    ├── Clustering stuck → Check CLUSTER_STORIES queue
    └── Images failing → Check Unsplash/Pexels API keys

Common Issues

To be documented as issues are encountered.