Tweeted By @ak92501
A Thousand Words Are Worth More Than a Picture:
— AK (@ak92501) January 17, 2022
Natural Language-Centric Outside-Knowledge Visual Question Answering
abs: https://t.co/pmciI9NUSg
TRiG framework outperforms all sota supervised methods by at least 11.1% absolute margin pic.twitter.com/CZkWbL4CPO