Tweeted By @jeremyphoward

on 2022-02-19 (UTC)
research cv

How Do Vision Transformers Work?
"...we propose AlterNet, a model in which Conv blocks at the end of a stage are replaced with MSA blocks. AlterNet outperforms CNNs not only in large data regimes but also in small data regimes." https://t.co/edPXnu0cn8
— Jeremy Howard (@jeremyphoward) February 19, 2022

Tweeted By @jeremyphoward

Tags