Tweeted By @hardmaru
Combined Scaling for Zero-shot Transfer Learning
— hardmaru (@hardmaru) November 22, 2021
Another data point for Sutton's “Bitter Lesson”: more data, bigger model, bigger batch sizes combined leads to big performance gains for zero-shot learning.
New paper by @hieupham789 and others @GoogleAIhttps://t.co/q97vHUsM45 pic.twitter.com/aXqKb5NaHu