Tweeted By @ak92501

on 2021-07-02 (UTC)
cv research

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
pdf: https://t.co/6KuG5MRGPM

85.4% Top-1 accuracy on ImageNet-1K without any extra training data or label, 53.9 box AP and 46.4 mask AP on the COCO detection task pic.twitter.com/pHZdSI0RBa
— AK (@ak92501) July 2, 2021

Tweeted By @ak92501

Tags