Tweeted By @yaroslavvb
Interesting -- the larger the model, the less data it needs to reach the same validation loss, this is the opposite of what statistics teaches us https://t.co/88qwts7klV
— Yaroslav Bulatov (@yaroslavvb) August 14, 2019
Interesting -- the larger the model, the less data it needs to reach the same validation loss, this is the opposite of what statistics teaches us https://t.co/88qwts7klV
— Yaroslav Bulatov (@yaroslavvb) August 14, 2019