MANIFOLD
BrowseUS ElectionNewsAbout
Will OpenAI release a tokenizer with vocab size > 150k by end of 2024?
Mini
9
Ṁ246
Dec 31
42%
chance
1D
1W
1M
ALL
  1. The GPT-2 model used r50k_base: vocab size = 50k

  2. The GPT-3 model used r50k_base: vocab size = 50k

  3. The GPT-3.5 model used cl100k_base: vocab size = 100k

  4. The GPT-4 model used cl100k_base: vocab size = 100k

#AI
#OpenAI
#AI Impacts
Get
Ṁ1,000
and
S1.00
Comments

Related questions

Will OpenAI reveal thinking tokens by the end of June 2025?
6% chance
Will the next major LLM by OpenAI use a new tokenizer?
77% chance
Will OpenAI release next-generation models with varying capabilities and sizes?
68% chance
Will OpenAI have a new name by the end of 2025?
5% chance
Will OpenAI release a tokenizer with more than 210000 tokens before 2026?
24% chance
Will a flagship (>60T training bytes) open-weights LLM from Meta which doesn't use a tokenizer be released in 2025?
20% chance
Will OpenAI release a version of Voice Engine by the end of 2024?
81% chance
OpenAI to release model weights by EOY?
84% chance

Related questions

Will OpenAI reveal thinking tokens by the end of June 2025?
6% chance
Will OpenAI release a tokenizer with more than 210000 tokens before 2026?
24% chance
Will the next major LLM by OpenAI use a new tokenizer?
77% chance
Will a flagship (>60T training bytes) open-weights LLM from Meta which doesn't use a tokenizer be released in 2025?
20% chance
Will OpenAI release next-generation models with varying capabilities and sizes?
68% chance
Will OpenAI release a version of Voice Engine by the end of 2024?
81% chance
Will OpenAI have a new name by the end of 2025?
5% chance
OpenAI to release model weights by EOY?
84% chance
Terms & Conditions•Privacy Policy•Sweepstakes Rules
BrowseElectionNewsAbout