Tweeted By @hardmaru

on 2019-12-06 (UTC)
research

a group with 1 other tweets.

Deep Double Descent: Where Bigger Models and More Data Hurt

They conduct an empirical study of the ‘double descent’ phenomenon in neural nets, and investigate this behavior in a range of architectures and its relationship to model size and training time.https://t.co/C9RfiRLlO4 pic.twitter.com/GnLyaExkQ1
— hardmaru (@hardmaru) December 6, 2019

Tags