Tweeted By @srush_nlp
New: Cascaded Text Generation with Markov Transformers (https://t.co/KUQ4tAeH0n, Yuntian Deng)
— Sasha Rush (@srush_nlp) June 2, 2020
Beam Search Translation : Serial but fluent.
Non-Autoregresssive (NAT): Parallel but disfluent (and kind of hacky...)
Why not parallel, fast, autoregressive, and accurate?
/thread pic.twitter.com/sOngb5rNyT