Tweeted By @hardmaru

on 2019-01-11 (UTC)
nlp research

a group with 6 other tweets.

Compared to RNNs, the Transformer family of architectures seemingly scale to hundreds of millions of parameters with relative ease. pic.twitter.com/Xo7Ng5b2Gj
— hardmaru (@hardmaru) January 11, 2019

Tags