Tweeted By @PyTorch
FairSeq Toolkit - Major Update
— PyTorch (@PyTorch) June 16, 2018
- Distributed Training
- Transformer models (big Transformer on WMT Eng-German in < 5 hours on DGX-1)
- Fast Inference: translations @ 92 sent/sec for big Transformer
- Story Generation
Read more at Michael Auli's post: https://t.co/eptKDuh0WI pic.twitter.com/d4OtJZpdFw