Tweeted By @_akhaliq
N-Grammer: Augmenting Transformers with latent n-grams
— AK (@_akhaliq) July 14, 2022
abs: https://t.co/Rx9wzbjoHj
propose modification to the Transformer architecture by augmenting the model with n-grams that are constructed from a discrete latent representation of the text sequence pic.twitter.com/RzrRBcVGR9