Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?
113
Ṁ20kDec 30
81%
chance
1D
1W
1M
ALL
50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

See also:
Get
1,000and
1.00
Sort by:
boughtṀ250YES
Related questions
Related questions
Opus 4.5's METR time horizon beats Gemini 3.0 Pro's?
81% chance
Gemini 3's METR 50% time horizon
Will the Gemini 3.0 Pro METR score release in 2025?
61% chance
Before 2026, will Gemini 3.0 exceed GPT-5 in Metr estimated time horizon?
77% chance
Will GPT-5.1 have a longer METR time horizon than Gemini 3?
21% chance
Will GPT-5.2 surpass 3 hours in METR time horizon?
74% chance
Opus 4.5's METR time horizon beats GPT-5.1's?
80% chance
Gemini 3's execution time-horizon?
-
Will GPT-5.2's METR 50% time horizon exceed 3 hours 30 minutes?
30% chance