XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
— Sebastian Ruder (@seb_ruder) April 16, 2021
We examine the state of multilingual benchmarking and propose an improved benchmark covering more challenging tasks, including a diagnostic and evaluation suite to inform future work.https://t.co/QCppOeNrV4 pic.twitter.com/pR8FRauZH6