Tweeted By @ch402
(3) The other interesting result is that you can create a different dataset of adversarial attacks, where you try to predict the attack class.
— Chris Olah (@ch402) May 9, 2019
They find this model - trained on adversarial attacks - generalizes to clean data, which I probably wouldn’t have predicted in advance. pic.twitter.com/ood8JL1dVy