Which Benchmarks will OpenAI show results from GPT-5 on, when it is announced?
➕
Plus
15
Ṁ3182
2026
94%
SimpleQA
87%
SWE-Bench
83%
GPQA
71%
HumanEval
67%
ARC-AGI-2
48%
MMLU
38%
MATH
35%
Big-Bench-Hard
31%
MGSM
19%
DROP
8%
GSM8K

Some flexibility on variations of specific benchmarks. eg SWE-Bench-Hard would resolve SWE-Bench YES.

  • Update 2025-05-11 (PST) (AI summary of creator comment): The benchmarks must be those that GPT-5 is benchmarked against by OpenAI.

Must be on roughly the same day / during / around the time of the announcement. If there are several announcements over multiple days, all those times are acceptable for the purpose of this market.

Get
Ṁ1,000
and
S1.00
Sort by:
bought Ṁ10 SimpleQA NO

you mean benchmarked by OpenAI?

@bbb I can't add options, I might create a duplicate where i can in a bit.

bought Ṁ30 MATH NO

@bbb Idk if i was actually able to change the settings back then but since then ive learned how to do it, so added arc agi 2