Tweeted By @DeepMind
We helped BERT do better on several structured prediction tasks by learning from a language model that knows more about syntax, showcasing the benefits of structural biases in large-scale models.
— DeepMind (@DeepMind) May 29, 2020
Read about it here: https://t.co/zm418xTGhp