Which AI predicts the 2026 World Cup better?
Claude, Gemini and OpenAI each predict all 72 group-stage matches. As the real results come in, every correct call scores points. The Brier score measures how well each model's probabilities are calibrated — lower is better.
Group stage progress0/72 matches
API, no tools + standardized context block (same data for every model).
Claude
0
points · 72 predictions submitted
Exact scores
0
Correct results
0
Points accuracy
0.0%
Result accuracy
0.0%
Brier (calibration ↓)
—
Scored matches
0
Gemini
0
points · 72 predictions submitted
Exact scores
0
Correct results
0
Points accuracy
0.0%
Result accuracy
0.0%
Brier (calibration ↓)
—
Scored matches
0
OpenAI
0
points · 72 predictions submitted
Exact scores
0
Correct results
0
Points accuracy
0.0%
Result accuracy
0.0%
Brier (calibration ↓)
—
Scored matches
0
How scoring works
- +5Exact score
- +3Correct result + one exact side
- +2Correct result only (W/D/L)
- +0Wrong result
Experiment design
See the prompts →- Web— Chat + live web access — free sourcing.
- Baseline— API, no tools — internal model knowledge only.
- Enriched— API, no tools + standardized context block (same data for every model).