Which Benchmarks will GPT-5 be benchmarked against, when it is announced?
➕
Plus
9
Ṁ2365
Jan 1
98%
SimpleQA
28%
GSM8K
71%
HumanEval
81%
MMLU
84%
GPQA
76%
MATH
52%
MGSM
32%
DROP
52%
Big-Bench-Hard
87%
SWE-Bench

Some flexibility on variations of specific benchmarks. eg SWE-Bench-Hard would resolve SWE-Bench YES.

Get
Ṁ1,000
and
S1.00
Sort by: