Tweeted By @ml_review
"LSTM is dead. Long Live Transformers!" by @leopd
— ML Review (@ml_review) August 23, 2020
Clear & brief 20 min technical deep dive into how Transformers work with multi-headed self-attention, and positional encodinghttps://t.co/FvyA8m41kg