Will Claude 3.5 Opus have a higher Chat Arena Elo than GPT-5?
Mini
26
Ṁ1020Jan 2
6%
chance
1D
1W
1M
ALL
Note that if OpenAI ends of changing their naming scheme to something else, I will count it if the model appears to be the one mentioned in this blog post: https://openai.com/index/openai-board-forms-safety-and-security-committee/
Additionally, it will only be resolved yes if 3.5 Opus sustains its position for the entire first month after both models are listed in the leaderboards, so if it passes GPT-5 temporarily due to noise it will not count.
Get
1,000
and1.00
Related questions
Related questions
Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
32% chance
Will Claude 3.5 Opus be able to draw me in tic-tac-toe while playing as O at least 1/3 of the time?
61% chance
Will any LLM outrank GPT-4 by 150 Elo in LMSYS chatbot arena before 2025?
20% chance
What will Claude 3.5 Opus's reported 0-shot performance on GPQA Diamond be upon release?
What will be the *first* ELO Rating of Claude 3.5 Opus in the LMSYS Arena?
Will there be a model with a 69%+ Chatbot Arena win rate against gpt-o1 before June 1st, 2025?
47% chance
Will Claude Opus be ranked in the top 20 on the Chatbot Arena Leaderboard two years from today (3/10/24)?
13% chance
Will Claude 3.5 Opus be available via API by end of 2025?
70% chance
Will an Open Source LLM Surpass any GPT-4 model in Elo Rating on Chatbot Arena on december 31, 2024?
96% chance
Will any open-source model rank in the top 3 on Chatbot Arena at any point in 2024? (resolves based on ELO rating)
4% chance