source_id
source_url
crawl_time
docs/25-ingestion-architecture.md
docs/28-crawling-and-sync.md
docs/26-multi-db-spec.md