Tweeted By @evolvingstuff

on 2019-01-10 (UTC)
nlp w_code research

a group with 6 other tweets.

REALLY cool improvement upon Transformer networks that makes use of recurrence and a relative positional encodings! Up to 1800x faster!

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Contexthttps://t.co/eV4iy1kOPT

TensorFlow & PyTorch: https://t.co/MqZAZKlhEn pic.twitter.com/TjrjOomYnb
— Thomas Lahore (@evolvingstuff) January 10, 2019

Tags