MaxViT : combines ConvNet modules and 2 types of self attention (local n'y block, and on a subsampled grid).
— Yann LeCun (@ylecun) October 10, 2022
Since DETR (hi @alcinos26 !), I've become convinced that combining Conv and attention/dynamic routing was the Right Thing. https://t.co/DNOBsqL54Z