Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition
— AK (@ak92501) June 24, 2021
pdf: https://t.co/2Kb6zoFwR1
github: https://t.co/Ahe17BEDwf
achieves 81.5% top-1 accuracy on ImageNet without extra large-scale training data (e.g., ImageNet-22k) using only 25M learnable parameters pic.twitter.com/KO1Kx7OnSi