Tweeted By @ai2_allennlp
We just released the multilingual C4 dataset! 1.4TB of German? 1.6TB of Spanish? 3.6TB of Russian 😲? We got it all. Check out the announcement at https://t.co/T6wcPDl5O7!
— AllenNLP (@ai2_allennlp) June 16, 2021
We just released the multilingual C4 dataset! 1.4TB of German? 1.6TB of Spanish? 3.6TB of Russian 😲? We got it all. Check out the announcement at https://t.co/T6wcPDl5O7!
— AllenNLP (@ai2_allennlp) June 16, 2021