Tweeted By @ak92501
ViDT: An Efficient and Effective Fully Transformer-based Object Detector
— AK (@ak92501) October 11, 2021
abs: https://t.co/rOytM75swG
obtains the best AP and latency trade-off among existing fully transformer-based object detectors, and achieves 49.2AP owing to its high scalability for large models pic.twitter.com/CVAzoT3dNh