Tweeted By @gwern
GPT-3 is terrifying because it's a tiny model compared to what's possible, trained in the dumbest way possible on a single impoverished modality on tiny data, yet the first version already manifests crazy runtime meta-learningโand the scaling curves ๐ด๐ต๐ช๐ญ๐ญ are not bending! ๐ฎ https://t.co/hQbW9znm3x
โ ๐๐ด๐ข๐ฏ๐ซ (@gwern) May 31, 2020