Tweeted By @dennybritz
Dropout has been a standard technique for years, but we’re still finding new ways to interpret it.
— Denny Britz (@dennybritz) July 2, 2018
I tend to think of new techniques/architectures as an advancement in either regularization or optimization/gradient flow, but sometimes it’s difficult to tell which one it is. https://t.co/74lt0VtExG