Which AI predicts the 2026 World Cup better?

Claude, Gemini and OpenAI each predict all 72 group-stage matches. As the real results come in, every correct call scores points. The Brier score measures how well each model's probabilities are calibrated — lower is better.

Group stage progress0/72 matches

API, no tools — internal model knowledge only.

Claude

0
points · 72 predictions submitted
Exact scores
0
Correct results
0
Points accuracy
0.0%
Result accuracy
0.0%
Brier (calibration ↓)
Scored matches
0

Gemini

0
points · 72 predictions submitted
Exact scores
0
Correct results
0
Points accuracy
0.0%
Result accuracy
0.0%
Brier (calibration ↓)
Scored matches
0

OpenAI

0
points · 72 predictions submitted
Exact scores
0
Correct results
0
Points accuracy
0.0%
Result accuracy
0.0%
Brier (calibration ↓)
Scored matches
0

How scoring works

  • +5Exact score
  • +3Correct result + one exact side
  • +2Correct result only (W/D/L)
  • +0Wrong result

Experiment design

See the prompts →
  • WebChat + live web access — free sourcing.
  • BaselineAPI, no tools — internal model knowledge only.
  • EnrichedAPI, no tools + standardized context block (same data for every model).