ID · per-country backtest
Indonesia forecast vs reality
Every Voidly Sentinel shutdown-risk forecast for Indonesia, plotted against what actually happened. 28 (predicted, observed) pairs from the rolling 30-day evaluation window.
Updated every 30 min · CC BY 4.0 · Raw JSON · Current forecast →
Forecasts evaluated
28
since Apr 17
Accuracy @ 0.5
32.1%
9/28 correct
Brier score
0.627
lower is better
Observed positive rate
68%
mean predicted 4%
Forecast time series
Blue line: forecast probability. Green ✓ markers: forecast was right. Red ✗ markers: forecast was wrong. Dashed line at 0.5 is the binary decision threshold.
Y axis: forecast probability · Faint tick: distance to observed outcome (0 or 1)
All 28 predictions (newest first)
| Eval date | Forecast | Pred ≥ 0.5? | Observed? | Correct? |
|---|---|---|---|---|
| May 14 | 4.9% | — | no event | ✓ |
| May 13 | 3.4% | — | no event | ✓ |
| May 12 | 5.0% | ↑ | no event | ✗ |
| May 11 | 4.8% | — | no event | ✓ |
| May 10 | 4.6% | — | no event | ✓ |
| May 9 | 4.9% | — | no event | ✓ |
| May 8 | 3.9% | — | no event | ✓ |
| May 7 | 2.3% | — | no event | ✓ |
| May 6 | 4.0% | — | no event | ✓ |
| May 5 | 4.8% | — | shutdown | ✗ |
| May 4 | 3.4% | — | shutdown | ✗ |
| May 3 | 3.1% | — | shutdown | ✗ |
| May 2 | 4.2% | — | shutdown | ✗ |
| May 1 | 4.0% | — | shutdown | ✗ |
| Apr 30 | 4.3% | — | shutdown | ✗ |
| Apr 29 | 4.9% | — | shutdown | ✗ |
| Apr 28 | 3.8% | — | shutdown | ✗ |
| Apr 27 | 3.2% | — | shutdown | ✗ |
| Apr 26 | 5.4% | ↑ | shutdown | ✓ |
| Apr 25 | 2.7% | — | shutdown | ✗ |
| Apr 24 | 4.6% | — | shutdown | ✗ |
| Apr 23 | 4.7% | — | shutdown | ✗ |
| Apr 22 | 3.0% | — | shutdown | ✗ |
| Apr 21 | 4.4% | — | shutdown | ✗ |
| Apr 20 | 4.6% | — | shutdown | ✗ |
| Apr 19 | 2.6% | — | shutdown | ✗ |
| Apr 18 | 3.5% | — | shutdown | ✗ |
| Apr 17 | 4.0% | — | shutdown | ✗ |
How to read this
- Each row is one historical forecast. We made the prediction at eval_date, then waited the 7-day horizon, then graded against the observed outcome.
- Forecast % is the calibrated probability we published that day. Post-recalibration (2026-05-20) these now match actual observed rates much better — see the refit finding.
- Pred ≥ 0.5 shows the binary alert decision. Note we usually fire alerts at a lower threshold (see /v1/sentinel/global_heatmap for the live cutoff) — the 0.5 column here is for backtest scoring consistency with the global confusion matrix.
- Correct = both (pred ≥ 0.5) and (observed) agree.
Related
- /atlas/forecast/id — current 7-day calibrated forecast for Indonesia with SHAP drivers
- /id — Indonesia country page (current state, recent incidents)
- /sentinel/backtest — global reliability diagram + per-country comparison table
- Calibration refit writeup