Tweeted By @hardmaru
Transformer-XL: Combining Transformers and RNNs Into a State-of-the-art Language Model
— hardmaru (@hardmaru) January 17, 2019
Blog post by @HorevRani giving an overview of the model and key concepts such as the recurrence mechanism and the relative positional encoding scheme.https://t.co/ORv18GkZBv https://t.co/l1OJKvUNyc