Tweeted By @ml_review
Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction
— ML Review (@ml_review) August 21, 2018
Outperforms SoTA encoder-decoder systems, while being conceptually simpler and having fewer parameters.
Githubhttps://t.co/QlSPJJZOFy
ArXivhttps://t.co/JjWpv9saG8 pic.twitter.com/GfTTNVZ54b