Tweeted By @filippie509
1/3 This is a good study which shows the extent of diminishing returns in deep learning: https://t.co/LiU3ZFfkGE . The largest model has a whopping 829M parameters, gets 3% performance gain over a model with 10x less parameters, with flattening curve. pic.twitter.com/PlgQr1u8fr
— Filip Piekniewski (@filippie509) June 24, 2019