manitcor@lemmy.intai.tech to

ChatGPT@lemmy.worldEnglish · 1 year ago

GPT-4's details are leaked.

threadreaderapp.com

144

GPT-4's details are leaked.

threadreaderapp.com

manitcor@lemmy.intai.tech to

ChatGPT@lemmy.worldEnglish · 1 year ago

Thread by @Yampeleg on Thread Reader App

threadreaderapp.com

@Yampeleg: GPT-4's details are leaked. It is over. Everything is here: twitter.com/i/web/status/1… Parameters count: GPT-4 is more than 10x the size of GPT-3. We believe it has a total of ~1.8 trillion parameters ac...…

cross-posted from: https://lemmy.intai.tech/post/72919

Parameters count:

GPT-4 is more than 10x the size of GPT-3. We believe it has a total of ~1.8 trillion parameters across 120 layers. Mixture Of Experts - Confirmed.

OpenAI was able to keep costs reasonable by utilizing a mixture of experts (MoE) model. They utilizes 16 experts within their model, each is about ~111B parameters for MLP. 2 of these experts are routed to per forward pass.

Related Article: https://lemmy.intai.tech/post/72922

Chat

Maple@lemmy.world
link
fedilink
English
arrow-up
14·
edit-2
1 year ago
“Half of those additions are censors and more creative ways to say ‘sorry, I can’t do that for you Jim.’” Lol, I’m just kidding, 1.8t parameters is incredible.

I just really hope that it’s not as censored as it currently is. ;_;