Tweeted By @ak92501
Swin Transformer V2: Scaling Up Capacity and Resolution
— AK (@ak92501) November 19, 2021
abs: https://t.co/vBm66uyUBZ
scaling Swin Transformer up to 3 billion parameters and making it capable of training with images of up to 1,536×1,536 resolution, sets new records on four representative vision benchmarks pic.twitter.com/2lvEvCOZ35