Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
Plus
13
Ṁ803resolved Sep 16
Resolved
YES1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).
If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.
There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.
Get
1,000and
1.00
Related questions
Related questions
Will OpenAI's o4 get above 50% on humanity's last exam?
31% chance
Will OpenAI claim that it has achieved AGI in 2025?
12% chance
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
66% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
85% chance
Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?
75% chance
OpenAI's next major AI model will be more open than GPT-4 by June 30, 2025
Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?
41% chance
Will AI image generating models score >= 90% on Winoground by June 1, 2025?
77% chance
Will any AI model score above 95% on GRAB by the end of 2025?
42% chance
Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?
19% chance