Tweeted By @RogerGrosse
"We define Internal Covariate Shift as the change in the
— Roger Grosse (@RogerGrosse) February 24, 2019
distribution of network activations due to the change in
network parameters during training." -batch norm paper
Those who cite ICS as a nebulous concept, can you give a plausible interpretation besides the obvious one?