Tweeted By @_akhaliq

on 2022-05-30 (UTC)
research cv nlp

GIT: A Generative Image-to-text Transformer for Vision and Language
abs: https://t.co/iFly0pcoXM

model surpasses the human performance for the first time on TextCaps (138.2 vs. 125.5 in CIDEr) pic.twitter.com/vn9LV98Dwr
— AK (@_akhaliq) May 30, 2022

Tweeted By @_akhaliq

Tags