← Back to Censorship Index

Methodology

How we measure global internet censorship

Version 2.0Updated: 2026-02-08JSON

Overview

Composite score from three data sources, processed through federated ML pipeline, updates continuously based on live network measurements.

When a government blocks a new service, our data reflects the change within hours.

Data Sources

OONI Measurements

Samples12,486,133
Coverage50 countries
TestsWeb, Messaging, Circumvention

Sensor Network

Nodes16
Coverage6 continents
Interval30 seconds

User Telemetry

Users...+
RetentionAggregated only
PrivacyNo individual tracking

ML Model

Gradient boosting classifier trained on 37K labeled censorship incidents. Federated learning across 16 nodes ensures no raw user data leaves local systems.

Model Specifications

TypeXGBoost Binary Classifier
Versionv2_gradient_boosting
F1 Score0.998
AUC-ROC1.000
Features8
Training Samples12.5M+
ScheduleDaily @ 02:00 UTC

Feature Importance

country_censorship_score
28%
destination_blocked
22%
node_success_rate
18%
user_country_encoded
12%
node_load
8%
hour_of_day
6%
day_of_week
4%
is_peak_hours
2%

Scoring System

0-100 scale. 0 = complete freedom. 100 = total censorship.

0-10
Free
Minimal or no censorship
11-25
Low
Limited content restrictions
26-45
Medium
Significant restrictions on some platforms
46-70
High
Widespread blocking of platforms and news
71-100
Severe
Pervasive censorship / isolated internet

Limitations

  • Scores are national averages — regional variations not captured
  • VPN detection underreported in highly restricted environments
  • Sample sizes vary by country — affects confidence levels
  • Real-time events may take up to 24h to reflect in scores
  • Content filtering and throttling harder to detect than blocking
  • Self-censorship and legal restrictions not measured

Confidence Intervals

Each country score includes a confidence interval reflecting measurement certainty. Wider intervals indicate less data or greater variability.

Country
Score
Interval
Confidence
Note
China
66%
± 2%
high
Large sample
Iran
42%
± 4%
high
Russia
31%
± 3%
high
Myanmar
21%
± 7%
medium
Smaller sample

Validation

Scores are validated against external benchmarks and known censorship events. Continuous evaluation ensures model accuracy over time.

BaselineFreedom House — Freedom on the Net
Correlationr = 0.87
Ground TruthKnown events (e.g. Iran shutdowns match score spikes)
Cross Validation5-fold
F1 Score0.998
AUC-ROC1.000

Update Pipeline

OONIIngestionFeature EngineeringML ScoringIndex Update
TrainingDaily @ 02:00 UTC
PublicationDaily @ 03:00 UTC
Score Latency~24h
Raw Ingestion~5min

Citation

Use this data in research? Please cite:

APA Format

Voidly Research. (2026). Global Censorship Index. https://voidly.ai/censorship-index

BibTeX

@misc{voidly_censorship_index,
  author = {Voidly Research},
  title = {Global Censorship Index},
  year = {2026},
  url = {https://voidly.ai/censorship-index}
}

License: CC BY 4.0 — Free to use with attribution

Data Access

Contact