voidly
SD · per-country backtest

Sudan forecast vs reality

Every Voidly Sentinel shutdown-risk forecast for Sudan, plotted against what actually happened. 28 (predicted, observed) pairs from the rolling 30-day evaluation window.

Updated every 30 min · CC BY 4.0 · Raw JSON · Current forecast →

Forecasts evaluated
28
since Apr 17
Accuracy @ 0.5
67.9%
19/28 correct
Brier score
0.229
lower is better
Observed positive rate
25%
mean predicted 4%

Forecast time series

Blue line: forecast probability. Green ✓ markers: forecast was right. Red ✗ markers: forecast was wrong. Dashed line at 0.5 is the binary decision threshold.

0.000.250.500.751.00threshold 0.50Apr 17May 14

Y axis: forecast probability · Faint tick: distance to observed outcome (0 or 1)

All 28 predictions (newest first)

Eval dateForecastPred ≥ 0.5?Observed?Correct?
May 144.2%no event
May 134.6%no event
May 123.1%no event
May 114.2%no event
May 104.0%no event
May 94.1%no event
May 84.8%no event
May 75.2%no event
May 64.7%no event
May 54.6%no event
May 45.1%no event
May 34.5%no event
May 24.8%shutdown
May 14.5%shutdown
Apr 305.1%shutdown
Apr 294.5%shutdown
Apr 284.2%shutdown
Apr 274.0%shutdown
Apr 263.4%no event
Apr 253.1%no event
Apr 244.2%no event
Apr 235.3%no event
Apr 223.9%no event
Apr 214.6%no event
Apr 204.5%no event
Apr 193.0%no event
Apr 184.8%no event
Apr 174.6%shutdown

How to read this

  • Each row is one historical forecast. We made the prediction at eval_date, then waited the 7-day horizon, then graded against the observed outcome.
  • Forecast % is the calibrated probability we published that day. Post-recalibration (2026-05-20) these now match actual observed rates much better — see the refit finding.
  • Pred ≥ 0.5 shows the binary alert decision. Note we usually fire alerts at a lower threshold (see /v1/sentinel/global_heatmap for the live cutoff) — the 0.5 column here is for backtest scoring consistency with the global confusion matrix.
  • Correct = both (pred ≥ 0.5) and (observed) agree.

Related