Tweeted By @hardmaru
Training Language GANs from Scratch
— hardmaru (@hardmaru) May 27, 2019
Latest work that attempts to train a language model entirely using GAN discriminator's loss function rather than maximum likelihood loss. Large population batch size and dense reward signal make REINFORCE less unhappy.https://t.co/KeEMtBjCFq pic.twitter.com/7058pHP1nU