MANIFOLD
BrowseUS ElectionNewsAbout
Open-source OpenAI model beats Grok 4 on LMArena?
21
Ṁ4058
Aug 9
6%
chance
1D
1W
1M
ALL

https://lmarena.ai/?leaderboard Arena Score without style control. The Open-Source OpenAI model considered for this market will be the most powerful to be released before August 9, a month after Grok 4 release.

#AI
#Technical AI Timelines
#OpenAI
#LLMs
#Grok 4
Get
Ṁ1,000
and
S1.00
Comments

Related questions

Will Grok 4 Top the Chatbot Leaderboard?
-27% 1d26% chance
Will Grok 4 achieve over 69% on SimpleBench
-14% 1d36% chance
What is Grok 4's performance on METR's task length evaluation?
When will xAI release Grok 4 (or Grok 3.5)
-
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
75% chance
Will Grok 3.5 Top the Chatbot Leaderboard?
1% chance
Llama 5 outperforms GPT 4o on LM Arena?
+3% 1d51% chance
Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
9% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
86% chance
When will xAI release Grok 4?

Related questions

Will Grok 4 Top the Chatbot Leaderboard?
26% chance
Will Grok 3.5 Top the Chatbot Leaderboard?
1% chance
Will Grok 4 achieve over 69% on SimpleBench
36% chance
Llama 5 outperforms GPT 4o on LM Arena?
51% chance
What is Grok 4's performance on METR's task length evaluation?
Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
9% chance
When will xAI release Grok 4 (or Grok 3.5)
-
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
86% chance
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
75% chance
When will xAI release Grok 4?
Terms & Conditions•Privacy Policy•Sweepstakes Rules
BrowseElectionNewsAbout