voidly
SA · per-country backtest

Saudi Arabia forecast vs reality

Every Voidly Sentinel shutdown-risk forecast for Saudi Arabia, plotted against what actually happened. 28 (predicted, observed) pairs from the rolling 30-day evaluation window.

Updated every 30 min · CC BY 4.0 · Raw JSON · Current forecast →

Forecasts evaluated
28
since Apr 17
Accuracy @ 0.5
42.9%
12/28 correct
Brier score
0.492
lower is better
Observed positive rate
54%
mean predicted 4%

Forecast time series

Blue line: forecast probability. Green ✓ markers: forecast was right. Red ✗ markers: forecast was wrong. Dashed line at 0.5 is the binary decision threshold.

0.000.250.500.751.00threshold 0.50Apr 17May 14

Y axis: forecast probability · Faint tick: distance to observed outcome (0 or 1)

All 28 predictions (newest first)

Eval dateForecastPred ≥ 0.5?Observed?Correct?
May 143.4%no event
May 134.3%no event
May 125.1%no event
May 114.9%no event
May 104.0%no event
May 93.4%no event
May 83.2%no event
May 75.4%no event
May 64.6%no event
May 54.3%no event
May 43.3%no event
May 33.9%no event
May 24.8%shutdown
May 14.5%shutdown
Apr 304.6%shutdown
Apr 293.4%shutdown
Apr 284.6%shutdown
Apr 273.6%shutdown
Apr 263.5%shutdown
Apr 254.8%shutdown
Apr 244.1%shutdown
Apr 234.4%shutdown
Apr 224.0%shutdown
Apr 214.0%shutdown
Apr 203.9%shutdown
Apr 195.1%shutdown
Apr 183.8%shutdown
Apr 172.6%no event

How to read this

  • Each row is one historical forecast. We made the prediction at eval_date, then waited the 7-day horizon, then graded against the observed outcome.
  • Forecast % is the calibrated probability we published that day. Post-recalibration (2026-05-20) these now match actual observed rates much better — see the refit finding.
  • Pred ≥ 0.5 shows the binary alert decision. Note we usually fire alerts at a lower threshold (see /v1/sentinel/global_heatmap for the live cutoff) — the 0.5 column here is for backtest scoring consistency with the global confusion matrix.
  • Correct = both (pred ≥ 0.5) and (observed) agree.

Related