Methodology β Ukraine War Analytics Data Collection
How we collect, process, and present conflict data for the Russia-Ukraine war.
1Data Collection
Our data is aggregated from multiple open-source intelligence (OSINT) providers. Each source uses different methodologies:
ACLED Event Data
Events are coded by trained researchers from news reports, social media, and local sources. Each event is geolocated and categorized by type.
Oryx Equipment Losses
Every loss is verified with photographic or video evidence. Items are categorized as destroyed, damaged, abandoned, or captured.
Official Sources
Government statistics from Ukraine, international organizations, and verified news agencies provide additional data points.
βData Pipeline Overview
Raw Sources
ACLED, Oryx, UNHCR, NASA FIRMS, World Bank
Collection
TypeScript fetch scripts run on GitHub Actions
Processing
Normalise, deduplicate, geolocate, aggregate
Validation
Schema checks, outlier detection, QA rules
Publication
Next.js static build β Vercel CDN delivery
2Data Processing
Raw data undergoes several processing steps before presentation:
- βNormalization: Data from different sources is standardized to common formats and categories.
- βDeduplication: Events reported by multiple sources are merged to avoid double-counting.
- βGeolocation: Events are mapped to administrative regions (oblasts) for geographic analysis.
- βAggregation: Daily, weekly, and monthly summaries are calculated from individual events.
3Event Classification
Events are classified using the ACLED taxonomy:
Battles
Armed clashes between organized military groups. Includes ground combat, territorial changes, and military offensives.
Explosions/Remote Violence
Artillery, missiles, airstrikes, and drone attacks. Events where violence is delivered from a distance.
Violence Against Civilians
Attacks targeting civilian populations, including war crimes, forced disappearances, and deliberate attacks on infrastructure.
Strategic Developments
Significant non-violent events including treaties, territorial agreements, and major political developments.
πOryx Equipment Verification Methodology
Equipment losses are tracked by Oryx, an independent OSINT team that applies a strict visual-evidence standard:
1. Visual Evidence Required
Every single loss entry requires at least one unique photo or video frame showing the damaged or destroyed vehicle. Text reports or official statements alone are insufficient.
2. Per-Vehicle Tracking
Each vehicle is tracked individually by model, markings, and image URL. The same vehicle cannot be counted twice β duplicate images are detected and excluded.
3. Status Categories
Destroyed: vehicle visually confirmed as a total loss (burned out, blown up, structurally destroyed). Damaged: visible damage but possibly repairable. Abandoned: intact vehicle left by crew. Captured: seized by the opposing side.
4. Conservative Undercount
Because losses without photographic evidence are excluded, Oryx figures are a confirmed minimum. Independent analysts estimate real losses are 20β40% higher than Oryx-tracked numbers, particularly for losses in Russian-controlled territory.
πHow Casualty Figures Are Counted
Casualty counting in an active conflict is extremely difficult. We use multiple sources with different methodologies:
π₯ OHCHR Civilian Casualties
The UN Human Rights Monitoring Mission verifies each civilian casualty individually. Their figures are conservative β representing confirmed cases where there was "reasonable grounds to believe" a civilian was killed or injured. The UN states real numbers are "considerably higher."
βοΈ Military Casualties
Military casualty estimates come from official Ukrainian and Western government statements, and ACLED fatality coding. All parties underreport their own losses for morale and strategic reasons. Figures should be treated as rough orders-of-magnitude, not precise counts.
π° ACLED Fatality Coding
ACLED researchers code fatality estimates from news reports, assigning best-estimate ranges to each event. These are not verified individually but represent the best available synthesis of open-source reporting.
β οΈ Key Caveats
We never publish raw official Russian or Ukrainian military casualty announcements as ground truth. All figures displayed are sourced from independent third-party organizations and are labeled as estimates where appropriate.
!Limitations & Caveats
Important limitations to consider when using our data:
- β οΈUnderreporting: Not all events are captured. Remote areas and active combat zones have limited reporting.
- β οΈFog of War: Initial reports may be inaccurate and are updated as more information becomes available.
- β οΈVerification Delays: Equipment losses require visual evidence, causing reporting delays.
- β οΈCasualty Estimates: True casualty figures are unknown. Official figures from all parties should be treated with caution.
- β οΈGeographic Bias: More events are reported in areas with better media access and communication infrastructure.
4Update Schedule
Daily
Event data updates
Weekly
Summary reports
Continuous
Equipment verification