Tweeted By @marian_nmt
Preparing the GPT-2 paper for a reading group. Seems to me the biggest danger is "destructive pre-processing" (love that term). NLP people, stop distributing oddly tokenized, shuffled, or otherwise mangled resources. This is the scourge of NLP. #NLProc
— Marian NMT (@marian_nmt) February 26, 2019