Will the next major LLM by OpenAI use a new tokenizer?
Plus
44
Ṁ12732025
77%
chance
1D
1W
1M
ALL
The GPT-2 model used r50k_base: vocab size = 50k
The GPT-3 model used r50k_base: vocab size = 50k
The GPT-3.5 model used cl100k_base: vocab size = 100k
The GPT-4 model used cl100k_base: vocab size = 100k
Get
1,000
and1.00
Sort by:
@firstuserhere So YES if there's a GPT-4.5/5 that uses a tokeniser not on this list, and NO if there's a GPT-4.5/5 that uses a tokeniser that is on this list?
Related questions
Related questions
Will OpenAI reveal a textless LLM before 2025?
19% chance
Will the next LLM released by OpenAI be worse than GPT-4 at MMLU?
16% chance
What is the next OpenAI LLM logo color?
Will OpenAI's next major LLM release support video input?
55% chance
Will OpenAI release a tokenizer with vocab size > 150k by end of 2024?
42% chance
When will OpenAI release a more capable LLM?
Will openAI have the most accurate LLM across most benchmarks by EOY 2024?
37% chance
Will OpenAI release an LLM moderation tool in 2024?
67% chance
Will OpenAI have the best LLM in 2024?
71% chance
Will a flagship (>60T training bytes) open-weights LLM from Meta which doesn't use a tokenizer be released in 2025?
43% chance