ET · per-country backtest
Ethiopia forecast vs reality
Every Voidly Sentinel shutdown-risk forecast for Ethiopia, plotted against what actually happened. 28 (predicted, observed) pairs from the rolling 30-day evaluation window.
Updated every 30 min · CC BY 4.0 · Raw JSON · Current forecast →
Forecasts evaluated
28
since Apr 17
Accuracy @ 0.5
75.0%
21/28 correct
Brier score
0.809
lower is better
Observed positive rate
93%
mean predicted 7%
Forecast time series
Blue line: forecast probability. Green ✓ markers: forecast was right. Red ✗ markers: forecast was wrong. Dashed line at 0.5 is the binary decision threshold.
Y axis: forecast probability · Faint tick: distance to observed outcome (0 or 1)
All 28 predictions (newest first)
| Eval date | Forecast | Pred ≥ 0.5? | Observed? | Correct? |
|---|---|---|---|---|
| May 14 | 8.7% | ↑ | no event | ✗ |
| May 13 | 7.4% | ↑ | shutdown | ✓ |
| May 12 | 7.4% | ↑ | shutdown | ✓ |
| May 11 | 8.1% | ↑ | shutdown | ✓ |
| May 10 | 7.4% | ↑ | shutdown | ✓ |
| May 9 | 5.8% | ↑ | shutdown | ✓ |
| May 8 | 9.5% | ↑ | shutdown | ✓ |
| May 7 | 8.3% | ↑ | shutdown | ✓ |
| May 6 | 9.3% | ↑ | shutdown | ✓ |
| May 5 | 10.5% | ↑ | shutdown | ✓ |
| May 4 | 8.5% | ↑ | shutdown | ✓ |
| May 3 | 7.4% | ↑ | shutdown | ✓ |
| May 2 | 5.6% | ↑ | shutdown | ✓ |
| May 1 | 6.3% | ↑ | shutdown | ✓ |
| Apr 30 | 9.1% | ↑ | shutdown | ✓ |
| Apr 29 | 8.5% | ↑ | shutdown | ✓ |
| Apr 28 | 6.6% | ↑ | shutdown | ✓ |
| Apr 27 | 5.8% | ↑ | shutdown | ✓ |
| Apr 26 | 6.4% | ↑ | shutdown | ✓ |
| Apr 25 | 6.6% | ↑ | shutdown | ✓ |
| Apr 24 | 5.1% | ↑ | shutdown | ✓ |
| Apr 23 | 4.5% | — | shutdown | ✗ |
| Apr 22 | 3.8% | — | shutdown | ✗ |
| Apr 21 | 3.6% | — | shutdown | ✗ |
| Apr 20 | 3.5% | — | shutdown | ✗ |
| Apr 19 | 4.3% | — | shutdown | ✗ |
| Apr 18 | 4.3% | — | shutdown | ✗ |
| Apr 17 | 3.5% | — | no event | ✓ |
How to read this
- Each row is one historical forecast. We made the prediction at eval_date, then waited the 7-day horizon, then graded against the observed outcome.
- Forecast % is the calibrated probability we published that day. Post-recalibration (2026-05-20) these now match actual observed rates much better — see the refit finding.
- Pred ≥ 0.5 shows the binary alert decision. Note we usually fire alerts at a lower threshold (see /v1/sentinel/global_heatmap for the live cutoff) — the 0.5 column here is for backtest scoring consistency with the global confusion matrix.
- Correct = both (pred ≥ 0.5) and (observed) agree.
Related
- /atlas/forecast/et — current 7-day calibrated forecast for Ethiopia with SHAP drivers
- /et — Ethiopia country page (current state, recent incidents)
- /sentinel/backtest — global reliability diagram + per-country comparison table
- Calibration refit writeup