Is code-davinci-002 just the largest non-GPT-4 model in the GPT-4 scaling law experiment? | Manifold

Is code-davinci-002 just the largest non-GPT-4 model in the GPT-4 scaling law experiment?

Mini

3

Ṁ35

Feb 17

44%

chance

1D

1W

1M

ALL

https://arxiv.org/pdf/2303.08774.pdf

Resolves Yes/No when I have >90% confidence.

Get

1,000

and

1.00

Sort by:

It's 4 orders of magnitude less compute than GPT-4! GPT-3 was 2 OOMs less compute.

Related questions

Will GPT-4 be trained (roughly) compute-optimally using the best-known scaling laws at the time?

What hardware will GPT-5 be trained on?

What will be true about GPT-5?

Will the performance jump from GPT4->GPT5 be less than the one from GPT3->GPT4?

Will GPT-5 have over 1 trillion parameters?

GPT-4 #5: Will GPT-4 be a dense model?

GPT-5 trained with >=24k GPUs?

How many parameters does GPT4o have?

Is GPT-4.5 the base model for o3?

Will GPT-5 have more than 10 trillion parameters?

Related questions

Will GPT-4 be trained (roughly) compute-optimally using the best-known scaling laws at the time?

GPT-4 #5: Will GPT-4 be a dense model?

What hardware will GPT-5 be trained on?

GPT-5 trained with >=24k GPUs?

What will be true about GPT-5?

How many parameters does GPT4o have?

Will the performance jump from GPT4->GPT5 be less than the one from GPT3->GPT4?

Is GPT-4.5 the base model for o3?

Will GPT-5 have over 1 trillion parameters?

Will GPT-5 have more than 10 trillion parameters?

Terms & Conditions•Privacy Policy•Sweepstakes Rules