Tweeted By @seb_ruder
Code and pretrained weights for BERT are out now.
— Sebastian Ruder (@seb_ruder) October 31, 2018
Includes scripts to reproduce results. BERT-Base can be fine-tuned on a standard GPU; for BERT-Large, a Cloud TPU is required (as max batch size for 12-16 GB is too small).https://t.co/CWv8GMZiX5