Tweeted By @fchollet
Gradient descent will take any shortcut available to map inputs to targets. Human perception works differently: it starts from a different input (embodied stream vs static images), it doesn't have a target for each input, and it isn't trained with SGD. https://t.co/coj5YmMwB3
— François Chollet (@fchollet) July 1, 2019