Methodology

How we measure global internet censorship

Version 2.0•Updated: 2026-02-08•JSON

Overview

Composite score from three data sources, processed through federated ML pipeline, updates continuously based on live network measurements.

When a government blocks a new service, our data reflects the change within hours.

Data Sources

OONI Measurements

Samples12,486,133

Coverage50 countries

TestsWeb, Messaging, Circumvention

Sensor Network

Nodes16

Coverage6 continents

Interval30 seconds

User Telemetry

Users...+

RetentionAggregated only

PrivacyNo individual tracking

ML Model

Gradient boosting classifier trained on 37K labeled censorship incidents. Federated learning across 16 nodes ensures no raw user data leaves local systems.

Model Specifications

TypeXGBoost Binary Classifier

Versionv2_gradient_boosting

F1 Score0.998

AUC-ROC1.000

Features8

Training Samples12.5M+

ScheduleDaily @ 02:00 UTC

Feature Importance

country_censorship_score

28%

destination_blocked

22%

node_success_rate

18%

user_country_encoded

12%

node_load

hour_of_day

day_of_week

is_peak_hours

Scoring System

0-100 scale. 0 = complete freedom. 100 = total censorship.

0-10

Free

Minimal or no censorship

11-25

Low

Limited content restrictions

26-45

Medium

Significant restrictions on some platforms

46-70

High

Widespread blocking of platforms and news

71-100

Severe

Pervasive censorship / isolated internet

Limitations

⚠Scores are national averages — regional variations not captured
⚠VPN detection underreported in highly restricted environments
⚠Sample sizes vary by country — affects confidence levels
⚠Real-time events may take up to 24h to reflect in scores
⚠Content filtering and throttling harder to detect than blocking
⚠Self-censorship and legal restrictions not measured

Confidence Intervals

Each country score includes a confidence interval reflecting measurement certainty. Wider intervals indicate less data or greater variability.

Country

Score

Interval

Confidence

Note

China

66%

± 2%

high

Large sample

Iran

42%

± 4%

high

Russia

31%

± 3%

high

Myanmar

21%

± 7%

medium

Smaller sample

Validation

Scores are validated against external benchmarks and known censorship events. Continuous evaluation ensures model accuracy over time.

BaselineFreedom House — Freedom on the Net

Correlationr = 0.87

Ground TruthKnown events (e.g. Iran shutdowns match score spikes)

Cross Validation5-fold

F1 Score0.998

AUC-ROC1.000

Update Pipeline

OONI→Ingestion→Feature Engineering→ML Scoring→Index Update

TrainingDaily @ 02:00 UTC

PublicationDaily @ 03:00 UTC

Score Latency~24h

Raw Ingestion~5min

Citation

Use this data in research? Please cite:

APA Format

Voidly Research. (2026). Global Censorship Index. https://voidly.ai/censorship-index

BibTeX

@misc{voidly_censorship_index,
  author = {Voidly Research},
  title = {Global Censorship Index},
  year = {2026},
  url = {https://voidly.ai/censorship-index}
}

License: CC BY 4.0 — Free to use with attribution