Tag - cv

by ak92501 on 2022-01-13 (UTC).

Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents
abs: https://t.co/ehdybJKTr5
project page: https://t.co/HnfvJnYcSM pic.twitter.com/6Q2vHRfCZm
— AK (@ak92501) January 13, 2022

dataset cv

by ak92501 on 2022-01-11 (UTC).

QuadTree Attention for Vision Transformers
abs: https://t.co/ImFIQAQsZn

4.0% improvement in feature matching on ScanNet, about 50% flops reduction in stereo matching, 0.4-1.5%
improvement in top-1 accuracy on ImageNet classification, 1.2-1.8% improvement on COCO object detection pic.twitter.com/0yxa0VnMCO
— AK (@ak92501) January 11, 2022

research cv

by rasbt on 2021-12-29 (UTC).

Impressive results via data distillation (aka reducing a large dataset to a synthetic, smaller one). Here, the researchers represent the 50k images in CIFAR-10 via just 10 images. A model trained on these 10 img achieves 64% accuracy on the orig test set https://t.co/MZHHSaDcK5 pic.twitter.com/3LRn4xAOse
— Sebastian Raschka (@rasbt) December 29, 2021

research cv

by ak92501 on 2021-12-28 (UTC).

Vision Transformer for Small-Size Datasets
abs: https://t.co/uy4e4Mo44m

SPT and LSA were applied to the ViTs, the performance improved by an average of 2.96% in TinyImageNet, which is a representative small-size dataset pic.twitter.com/QxZUoqJGCQ
— AK (@ak92501) December 28, 2021

research cv

by ai_fast_track on 2021-12-06 (UTC).

🎉Part 2- Summary of 10 summaries on:

Tips & Trick & Best Practices in training (not only) object detection models.

Don't miss any of those posts, follow @ai_fast_track to catch them in your feed.

🎁 Summary of summaries: ... pic.twitter.com/VLcWNkMaph
— AI Fast Track (60/60) (@ai_fast_track) December 6, 2021

cv learning tip tutorial

by ak92501 on 2021-12-04 (UTC).

.@Gradio Demo for Pyxelate: convert images to pixel art now on @huggingface Spaces
demo: https://t.co/T8cBs8lk0o
github: https://t.co/pdWuidyvaM pic.twitter.com/s0PM9KHK63
— AK (@ak92501) December 4, 2021

cv application

by ak92501 on 2021-12-03 (UTC).

BEVT: BERT Pretraining of Video Transformers
abs: https://t.co/6BI5E3f9Cv pic.twitter.com/tV5ASUKHMd
— AK (@ak92501) December 3, 2021

cv research

by ak92501 on 2021-12-01 (UTC).

AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
abs: https://t.co/JkGgzi64CW

experiments on ImageNet, method obtains more than 2× improvement on efficiency compared to sota vision
transformers with 0.8% drop of accuracy pic.twitter.com/wIcMjPA72X
— AK (@ak92501) December 1, 2021

research cv

by ak92501 on 2021-12-01 (UTC).

Donut 🍩 : Document Understanding Transformer without OCR
abs: https://t.co/A644UXgUuG

achieves sota performance on various document understanding tasks in public benchmark datasets and private industrial service datasets pic.twitter.com/2broMiK9r5
— AK (@ak92501) December 1, 2021

nlp cv research

by ak92501 on 2021-12-01 (UTC).

Pyramid Adversarial Training Improves ViT Performance
abs: https://t.co/oaxB6Q99R2

new sota for ImageNet-C (41.4 mCE), ImageNetR (53.92%), and ImageNet-Sketch (41.04%) without extra
data, using only the ViT-B/16 backbone and pyramid
adversarial training pic.twitter.com/afdSG0J35i
— AK (@ak92501) December 1, 2021

research cv

by ak92501 on 2021-11-30 (UTC).

Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity
abs: https://t.co/U4I6gnNvej

Sparse DETR achieves better performance than Deformable DETR even with only 10% encoder tokens on the COCO dataset pic.twitter.com/gcERMMxUu4
— AK (@ak92501) November 30, 2021

research cv

by ak92501 on 2021-11-25 (UTC).

Self-slimmed Vision Transformer
abs: https://t.co/BHQIZVWZlN pic.twitter.com/ToB9DODFwU
— AK (@ak92501) November 25, 2021

research cv

Tag: cv

Tags