Tweeted By @martin_gorner

on 2022-10-11 (UTC)
research cv

a group with 1 other tweets.

This looks like the Vision Transformers architecture we have been waiting for: MaxViT https://t.co/WbzgJ50PjB
1/ State of the Art accuracy on ImageNet (no pre-training on huge datasets)
2/ Linear complexity wrt. image size (thanks to a clever attention design) pic.twitter.com/5bW0N7n3s5
— Martin Görner (@martin_gorner) October 11, 2022

Tags