Tweeted By @ak92501
WIT: Wikipedia-based Image Text Dataset for Multimodal
— AK (@ak92501) March 3, 2021
Multilingual Machine Learning
pdf: https://t.co/fblyzH2hGe
abs: https://t.co/tVgBdfOnQ5
github: https://t.co/NNkF3oheok pic.twitter.com/nnFUaPJaYU