Tweeted By @ak92501
DiT: Self-supervised Pre-training for Document Image Transformer
— AK (@ak92501) March 7, 2022
abs: https://t.co/OUQ94iQ6dY
achieves sota results on downstream tasks, e.g. document image classification (91.11 → 92.69), document layout analysis (91.0 → 94.9) and table detection (94.23 → 96.55) pic.twitter.com/uZWAMGh71s