voidly
← Back to Censorship Index

Methodology

How we measure global internet censorship

Version 2.0Updated: 2026-03-25JSON

Overview

Composite score from multiple measurement networks, processed through our ML pipeline, updates continuously based on live network measurements.

When a government blocks a new service, our data reflects the change within hours.

Data Sources

OONI Measurements

Samples25,342,690
Coverage127 countries
TestsWeb, Messaging, Circumvention

Sensor Network

Nodes37
Coverage6 continents
Probes (24h)~7,000+

External Sources

IODAInternet outage detection (ASN-level)
CensoredPlanetRemote DNS/HTTP blocking (50 countries)
Citizen LabDomain categorization (14K+ domains)

Multi-Source Correlation

No single measurement network captures the full picture of internet censorship. OONI provides active probing but has geographic gaps. CensoredPlanet provides remote measurement but lacks ground truth. IODA detects outages but not selective blocking.

Voidly operates its own 37-node network across 6 continents — testing VPN accessibility and censorship patterns every 5 minutes — then correlates these proprietary measurements with three external measurement networks (OONI, CensoredPlanet, IODA) to produce verified incidents with evidence chains. This turns ambiguous network anomalies into structured, citable censorship intelligence.

ML Model

Gradient boosting classifier trained on 37K labeled censorship incidents. Privacy-preserving training on aggregate data only — no raw user data is used.

Censorship Classifier

AlgorithmGradientBoosting
F1 Score99.8% F1 (internal eval)
AUC-ROC1.000 AUC (internal eval)
Training Samples37K labeled incidents
ScheduleWeekly (Sundays @ 02:00 UTC)

Shutdown Forecast Model

AlgorithmXGBoost
ROC AUC74.6%
Recall50%
Training Samples14.6K historical records
ScheduleWeekly (Sundays @ 02:00 UTC)

Feature Importance (Classifier)

country_risk_tier
85%
anomaly_rate
6%
measurement_count
4%
source_diversity
3%
temporal_pattern
2%

Importance values from GradientBoosting model trained on 37K labeled incidents.

Scoring System

0-100 scale. 0 = complete freedom. 100 = total censorship.

0-10
Free
Minimal or no censorship
11-25
Low
Limited content restrictions
26-45
Medium
Significant restrictions on some platforms
46-70
High
Widespread blocking of platforms and news
71-100
Severe
Pervasive censorship / isolated internet

Limitations

  • Scores are national averages — regional variations not captured
  • VPN detection underreported in highly restricted environments
  • Sample sizes vary by country — affects confidence levels
  • Real-time events may take up to 24h to reflect in scores
  • Content filtering and throttling harder to detect than blocking
  • Self-censorship and legal restrictions not measured

Confidence Intervals

Each country score includes a confidence interval reflecting measurement certainty. Wider intervals indicate less data or greater variability.

Country
Score (illustrative)
Interval
Confidence
Note
Country A
66%
± 2%
high
Large sample
Country B
42%
± 4%
high
Country C
31%
± 3%
high
Country D
21%
± 7%
medium
Smaller sample

Scores shown are illustrative examples from a point-in-time snapshot. Live scores update continuously on the Censorship Index.

Validation

Scores are validated against external benchmarks and known censorship events. Continuous evaluation ensures model accuracy over time.

BaselineFreedom House — Freedom on the Net
Correlationr = 0.87 (vs. Freedom House FOTN, self-reported)
Ground TruthKnown events (e.g. Iran shutdowns match score spikes)
Cross Validation5-fold
Classifier F199.8% F1 (internal eval)
Classifier AUC-ROC1.000 AUC (internal eval)
Forecast AUC-ROC74.6% (internal eval)

Metrics are from internal 5-fold cross-validation. No independent third-party evaluation has been conducted. Published tools and data are available for independent replication.

Update Pipeline

OONIIngestionFeature EngineeringML ScoringIndex Update
Classifier RetrainWeekly (Sundays @ 02:00 UTC)
PublicationDaily @ 03:00 UTC
Score Latency~6h (aggregated), ~5min (probes)
Raw IngestionEvery 5min (probes), every 6h (OONI/IODA/CensoredPlanet)

Citation

Use this data in research? Please cite:

APA Format

Voidly Research. (2026). Global Censorship Index. https://voidly.ai/censorship-index

BibTeX

@misc{voidly_censorship_index,
  author = {Voidly Research},
  title = {Global Censorship Index},
  year = {2026},
  url = {https://voidly.ai/censorship-index}
}

License: CC BY 4.0 — Free to use with attribution

Data Access

Contact

Researchresearch@voidly.ai
Partnershipspartnerships@voidly.ai
Generalteam@voidly.ai