TR · per-country backtest
Turkey forecast vs reality
Every Voidly Sentinel shutdown-risk forecast for Turkey, plotted against what actually happened. 28 (predicted, observed) pairs from the rolling 30-day evaluation window.
Updated every 30 min · CC BY 4.0 · Raw JSON · Current forecast →
Forecasts evaluated
28
since Apr 17
Accuracy @ 0.5
32.1%
9/28 correct
Brier score
0.517
lower is better
Observed positive rate
57%
mean predicted 6%
Forecast time series
Blue line: forecast probability. Green ✓ markers: forecast was right. Red ✗ markers: forecast was wrong. Dashed line at 0.5 is the binary decision threshold.
Y axis: forecast probability · Faint tick: distance to observed outcome (0 or 1)
All 28 predictions (newest first)
| Eval date | Forecast | Pred ≥ 0.5? | Observed? | Correct? |
|---|---|---|---|---|
| May 14 | 6.9% | ↑ | no event | ✗ |
| May 13 | 8.5% | ↑ | no event | ✗ |
| May 12 | 7.1% | ↑ | no event | ✗ |
| May 11 | 8.2% | ↑ | no event | ✗ |
| May 10 | 5.9% | ↑ | no event | ✗ |
| May 9 | 7.9% | ↑ | no event | ✗ |
| May 8 | 6.1% | ↑ | no event | ✗ |
| May 7 | 6.1% | ↑ | no event | ✗ |
| May 6 | 6.8% | ↑ | no event | ✗ |
| May 5 | 7.0% | ↑ | no event | ✗ |
| May 4 | 5.3% | ↑ | no event | ✗ |
| May 3 | 6.3% | ↑ | no event | ✗ |
| May 2 | 6.6% | ↑ | shutdown | ✓ |
| May 1 | 6.5% | ↑ | shutdown | ✓ |
| Apr 30 | 7.0% | ↑ | shutdown | ✓ |
| Apr 29 | 7.0% | ↑ | shutdown | ✓ |
| Apr 28 | 7.0% | ↑ | shutdown | ✓ |
| Apr 27 | 5.2% | ↑ | shutdown | ✓ |
| Apr 26 | 6.6% | ↑ | shutdown | ✓ |
| Apr 25 | 5.6% | ↑ | shutdown | ✓ |
| Apr 24 | 4.6% | — | shutdown | ✗ |
| Apr 23 | 2.6% | — | shutdown | ✗ |
| Apr 22 | 3.3% | — | shutdown | ✗ |
| Apr 21 | 4.0% | — | shutdown | ✗ |
| Apr 20 | 3.2% | — | shutdown | ✗ |
| Apr 19 | 3.5% | — | shutdown | ✗ |
| Apr 18 | 5.2% | ↑ | shutdown | ✓ |
| Apr 17 | 3.7% | — | shutdown | ✗ |
How to read this
- Each row is one historical forecast. We made the prediction at eval_date, then waited the 7-day horizon, then graded against the observed outcome.
- Forecast % is the calibrated probability we published that day. Post-recalibration (2026-05-20) these now match actual observed rates much better — see the refit finding.
- Pred ≥ 0.5 shows the binary alert decision. Note we usually fire alerts at a lower threshold (see /v1/sentinel/global_heatmap for the live cutoff) — the 0.5 column here is for backtest scoring consistency with the global confusion matrix.
- Correct = both (pred ≥ 0.5) and (observed) agree.
Related
- /atlas/forecast/tr — current 7-day calibrated forecast for Turkey with SHAP drivers
- /tr — Turkey country page (current state, recent incidents)
- /sentinel/backtest — global reliability diagram + per-country comparison table
- Calibration refit writeup