Tweeted By @ak92501
Revisiting 3D ResNets for Video Recognition
— AK (@ak92501) September 7, 2021
pdf: https://t.co/UwhpllMj6z
abs: https://t.co/LofJjbdq2P
When pre-trained on a large Web Video Text dataset, our best model achieves 83.5% and 84.3% on Kinetics-400 and Kinetics-600 pic.twitter.com/F7Wmj2yW6X