Tutorial: Monitor citation churn on a medical/scientific page
Goal
Track citation additions and removals on a high-traffic medical Wikipedia page to detect sourcing changes over time.
Steps
1. Analyze a medical page
refract analyze "COVID-19" --depth detailed -c
2. Export citation data as CSV
refract export "COVID-19" --format csv
This produces a CSV with citation URLs, add/remove timestamps, and revision IDs — ready for spreadsheet analysis.
3. Pipe to file
refract export "COVID-19" --format csv > churn-report.csv
Use case: identifying biased sourcing
Citation churn patterns reveal source instability. A citation added in one edit and removed in the next signals contested sourcing. Citation replacement (journal swapped for a news article) indicates editors updating their evidence base. Churn concentrated on one page section may point to a content dispute.
Example output
{
"eventId": "d1e3f5a7b9c2048a",
"eventType": "citation_added",
"fromRevisionId": 1280090010,
"toRevisionId": 1280090100,
"section": "Vaccine efficacy",
"before": "",
"after": "<ref>{{cite journal |last1=Smith |title=...}}</ref>",
"timestamp": "2024-11-20T10:00:00Z",
"layer": "observed",
"deterministicFacts": [
{
"fact": "Citation added in section Vaccine efficacy",
"provenance": {
"analyzer": "citation-tracker",
"version": "0.3.1",
"inputHashes": []
}
}
]
}
{
"eventId": "2b4d6f8a0c1e3059",
"eventType": "citation_removed",
"fromRevisionId": 1280090100,
"toRevisionId": 1280090200,
"section": "Vaccine efficacy",
"before": "<ref>{{cite journal |last1=Smith |title=...}}</ref>",
"after": "",
"timestamp": "2024-11-21T14:30:00Z",
"layer": "observed",
"deterministicFacts": [
{
"fact": "Citation removed in section Vaccine efficacy",
"provenance": {
"analyzer": "citation-tracker",
"version": "0.3.1",
"inputHashes": []
}
}
]
}
Notes
- Use
--format csvor--format ndjsononrefract exportfor spreadsheet-importable churn reports. - Group by source domain to see which types of sources churn most.
- Refract tracks the citation change, not source quality — that's outside the boundary.