Tweeted By @Thom_Wolf
If you want a sneek-peek in @YejinChoinka, @rown and co-workers work on GROVER (a 1.5 billion param GPT-2-like model), check this live tweet 👇
— Thomas Wolf (@Thom_Wolf) June 6, 2019
Interesting hints, results, and analysis!
Paper: https://t.co/aJJrrmQjJc
Demo: https://t.co/MDfzlYa1iE https://t.co/jjwQEUU0tI