Tweeted By @ak92501

on 2021-06-04 (UTC)
research cv

When Vision Transformers Outperform ResNets without Pretraining or Strong Data Augmentations
pdf: https://t.co/GYknaVoNAM
abs: https://t.co/kaUxIdMVNQ

+5.3% and +11.0% top-1 accuracy on ImageNet for ViT-B/16 and MixerB/16, with the simple Inception-style preprocessing pic.twitter.com/EI1ZSUccUn
— AK (@ak92501) June 4, 2021

Tweeted By @ak92501

Tags