Benchmark Gap #4: Once a single AI model solves >= 95% of miniF2F, MATH, and MMLU STEM, how many months will it be before an AI is listed as a (co) first author on a published math paper? | Manifold

Benchmark Gap #4: Once a single AI model solves >= 95% of miniF2F, MATH, and MMLU STEM, how many months will it be before an AI is listed as a (co) first author on a published math paper?

Mini

9

Ṁ599

2050

37

expected

1D

1W

1M

ALL

This question is meant to measure the gap between solving the main math-based benchmarks at the time of market creation, and contributing to real world mathematics.

The co first author requirement is loose: I will also accept an AI being credited with significant contributions to both deciding what to prove and the actual proof (merely contributing to the proof is not enough - I am trying to get at "the AI does the work of a mathematician" not "the AI does the work of a proof assistant"). I would also accept, for instance, the human author of the paper expressing that they would have named the AI as a coauthor if it was human, or saying that the result could not have been obtained without the assistance of the AI.

#Technical AI Timelines

Get

1,000

and

1.00

Sort by:

In a lot of pure math, author order is arbitrary/alphabetical. Removing that, I second that it'll be 0. Maybe negative.

I think it is plausible that it will be <0

People already list ChatGPT as a coauthor in scientific papers but not in math yet.

Related questions

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

Benchmark Gap #5: Once a single AI model solves >= 95% of miniF2F, MATH, and MMLU STEM, will it be less than two years before AI models are used as entry-level data science / data analysis / statistics workers?

Will any AI model achieve > 40% on Frontier Math before 2026?

Will AI contribute as much as a co-author would today to a real research mathematics paper before Jan 1 2026?

Will an AI co-author a mathematics research paper published in a reputable journal before the end of 2026?

AI outperforms humans in all mathematical research areas by 2028?

Benchmark Gap #8: Once a single AI gets >= 80% on FrontierMath Tier 4, how long until an AI publishes a math paper?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Will an AI score over 80% on FrontierMath Benchmark in 2025

Will an AI model write the proof to the Riemann Hypothesis by the end of 2025?

Related questions

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

AI outperforms humans in all mathematical research areas by 2028?

Benchmark Gap #5: Once a single AI model solves >= 95% of miniF2F, MATH, and MMLU STEM, will it be less than two years before AI models are used as entry-level data science / data analysis / statistics workers?

Benchmark Gap #8: Once a single AI gets >= 80% on FrontierMath Tier 4, how long until an AI publishes a math paper?

Will any AI model achieve > 40% on Frontier Math before 2026?

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

Will AI contribute as much as a co-author would today to a real research mathematics paper before Jan 1 2026?

Will an AI score over 80% on FrontierMath Benchmark in 2025

Will an AI co-author a mathematics research paper published in a reputable journal before the end of 2026?

Will an AI model write the proof to the Riemann Hypothesis by the end of 2025?

Terms & Conditions•Privacy Policy•Sweepstakes Rules