Attention Is All You Need, but CNNs and RNNs help.
— hardmaru (@hardmaru) August 20, 2018
CNN: https://t.co/b7UJ5bEDWu
RNN: https://t.co/m0YUVX1ySM
Attention Is All You Need, but CNNs and RNNs help.
— hardmaru (@hardmaru) August 20, 2018
CNN: https://t.co/b7UJ5bEDWu
RNN: https://t.co/m0YUVX1ySM
Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction
— ML Review (@ml_review) August 21, 2018
Outperforms SoTA encoder-decoder systems, while being conceptually simpler and having fewer parameters.
Githubhttps://t.co/QlSPJJZOFy
ArXivhttps://t.co/JjWpv9saG8 pic.twitter.com/GfTTNVZ54b