.@Gradio Demo for YOLOv5 Det v0.4 on @huggingface Spaces
— AK (@ak92501) May 28, 2022
demo: https://t.co/P5n6kg8ZZG
Join the Blocks Party: https://t.co/JnhdF6acqX pic.twitter.com/AU7WXa0QAn
.@Gradio Demo for YOLOv5 Det v0.4 on @huggingface Spaces
— AK (@ak92501) May 28, 2022
demo: https://t.co/P5n6kg8ZZG
Join the Blocks Party: https://t.co/JnhdF6acqX pic.twitter.com/AU7WXa0QAn
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
— AK (@ak92501) May 27, 2022
abs: https://t.co/VHJYoC4ewJ
project page: https://t.co/goVCa1VXbi
github: https://t.co/AXYYJr2vcd pic.twitter.com/d1gQwwgVMw
New blog post about the magic of diffusion guidance!https://t.co/BITNC4nMLM
— Sander Dieleman (@sedielem) May 26, 2022
Guidance powers the recent spectacular results in text-conditioned image generation (DALL·E 2, Imagen), so the time is right for a closer look at this simple, yet extremely effective technique.
Inception Transformer
— AK (@ak92501) May 26, 2022
abs: https://t.co/EoPDBOafSS
iFormer-S hits the top-1 accuracy of 83.4% on ImageNet-1K, much higher than DeiT-S by 3.6%, and even slightly better than much bigger model Swin-B (83.3%) with only 1/4 parameters and 1/3 FLOPs pic.twitter.com/TdtFJfW7w1
Pretraining is All You Need for Image-to-Image Translation
— AK (@ak92501) May 26, 2022
abs: https://t.co/AafrOKGSak
project page: https://t.co/jLrY13lF0N pic.twitter.com/w5fStjw1mm
RTMV: A Ray-Traced Multi-View Synthetic Dataset for Novel View Synthesis
— AK (@ak92501) May 17, 2022
abs: https://t.co/4kdIL9g29r pic.twitter.com/JmXpCqFPXx
Simple Open-Vocabulary Object Detection with Vision Transformers
— AK (@ak92501) May 13, 2022
abs: https://t.co/ytb2Tvliu1 pic.twitter.com/0xUjokjLcB
.@Gradio Demo for CaptchaCracker, an open source Python library that provides functions to create and apply deep learning models for Captcha Image recognition on @huggingface Spaces
— AK (@ak92501) May 9, 2022
demo: https://t.co/jXbFUsSgqx
github: https://t.co/tEFCEB43uM pic.twitter.com/MsUisP4wBd
Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items
— AK (@ak92501) April 27, 2022
abs: https://t.co/eVRIkbLeqp pic.twitter.com/jRDQEYfYnV
Understanding The Robustness in Vision Transformers
— AK (@ak92501) April 27, 2022
abs: https://t.co/reQ35twd49
model achieves a state-of-the-art 87.1% accuracy and 35.8% mCE on ImageNet-1k and ImageNet-C with 76.8M parameters pic.twitter.com/H5pTsUpEE0
High Quality Segmentation for Ultra High-resolution Images
— AK (@ak92501) April 25, 2022
abs: https://t.co/ljV7jl3olc
github: https://t.co/57fX0G15IM pic.twitter.com/SEZAHEQe8K
A Tour of Visualization Techniques for Computer Vision Datasets
— AK (@ak92501) April 20, 2022
abs: https://t.co/N0j9jlMZcC pic.twitter.com/bKzgtpj616