Tweeted By @Tim_Dettmers
I was made aware of two papers that are similar and preceded both OpenAI papers. I think these add more data points to scaling behavior for language (and also vision). These should be shared more widely! https://t.co/ZCXYCt3DgN https://t.co/QNn8KznjXe
— Tim Dettmers (@Tim_Dettmers) November 29, 2020