Will a new deep learning paradigm replace the transformer by the end of 2024? | Manifold

Will a new deep learning paradigm replace the transformer by the end of 2024?

Plus

20

Ṁ1395

Jan 1

8%

chance

1D

1W

1M

ALL

Will a new neural architecture, or entirely different machine learning method, replace the dominance of the query, key, and value attention-based neural architectures currently dominant in large language models? Or will large language models (or more generally foundational models with different modalities) continue to scale up transformers? Fundamentally this new method must not employ layers of self-attention or cross-attention, but must show scaling laws more promising than transformer-based LLMs. This method must be commonly recognized by practitioners to be superior to transformer methods, and multiple state-of-the-art open (and closed) source models must employ this new method. From the invention of transformer, it took a few years for them to be universally utilized. However, with modern attention on foundational models, adoption of a better approach should be significantly more swift.

#️ Technology

Get

1,000

and

1.00

Sort by:

Personally I have a bit of faith in this concept:

https://arxiv.org/abs/2312.00752

predicts NO

@Ebcc1 Definitely on my radar!

predicts YES

@Supermaxman What do you think about it and other possibilities?

predicts NO

@Ebcc1 Watching to see how peers receive it at ICLR: https://openreview.net/forum?id=AL1fq05o7H

Related questions

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2024?

By EOY 2026, will it seem as if deep learning hit a wall by EOY 2025?

Will Transformer based architectures still be SOTA for language modelling by 2026?

Will transformers still be the dominant DL architecture in 2026?

Will I complete Practical Deep Learning for Coders in 2024?

Will a new type of AI model replace transformers as the state-of-the-art artificial intelligence technology by 2024?

Will there be another major public-facing breakthrough in AI before December 31, 2024 [subjective - 1000M boost added]

Will Mamba be the de-facto paradigm for LLMs over transformers by 2025?

Will any open-source Transformers LLM model that function as a dense mixture of experts be released by end of 2024?

Will there be any large-scale protests about content generated by Deep Learning by the end of 2025?

Related questions

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2024?

Will a new type of AI model replace transformers as the state-of-the-art artificial intelligence technology by 2024?

By EOY 2026, will it seem as if deep learning hit a wall by EOY 2025?

Will there be another major public-facing breakthrough in AI before December 31, 2024 [subjective - 1000M boost added]

Will Transformer based architectures still be SOTA for language modelling by 2026?

Will Mamba be the de-facto paradigm for LLMs over transformers by 2025?

Will transformers still be the dominant DL architecture in 2026?

Will any open-source Transformers LLM model that function as a dense mixture of experts be released by end of 2024?

Will I complete Practical Deep Learning for Coders in 2024?

Will there be any large-scale protests about content generated by Deep Learning by the end of 2025?

Terms & Conditions•Privacy Policy•Sweepstakes Rules