Tweeted By @ak92501
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
— AK (@ak92501) July 2, 2021
pdf: https://t.co/6KuG5MRGPM
85.4% Top-1 accuracy on ImageNet-1K without any extra training data or label, 53.9 box AP and 46.4 mask AP on the COCO detection task pic.twitter.com/pHZdSI0RBa