Tag - cv

by ak92501 on 2022-05-28 (UTC).

.@Gradio Demo for YOLOv5 Det v0.4 on @huggingface Spaces
demo: https://t.co/P5n6kg8ZZG
Join the Blocks Party: https://t.co/JnhdF6acqX pic.twitter.com/AU7WXa0QAn
— AK (@ak92501) May 28, 2022

cv application

by ak92501 on 2022-05-27 (UTC).

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
abs: https://t.co/VHJYoC4ewJ
project page: https://t.co/goVCa1VXbi
github: https://t.co/AXYYJr2vcd pic.twitter.com/d1gQwwgVMw
— AK (@ak92501) May 27, 2022

research w_code cv

by sedielem on 2022-05-26 (UTC).

New blog post about the magic of diffusion guidance!https://t.co/BITNC4nMLM

Guidance powers the recent spectacular results in text-conditioned image generation (DALL·E 2, Imagen), so the time is right for a closer look at this simple, yet extremely effective technique.
— Sander Dieleman (@sedielem) May 26, 2022

learning tutorial cv nlp

by ak92501 on 2022-05-26 (UTC).

Inception Transformer
abs: https://t.co/EoPDBOafSS

iFormer-S hits the top-1 accuracy of 83.4% on ImageNet-1K, much higher than DeiT-S by 3.6%, and even slightly better than much bigger model Swin-B (83.3%) with only 1/4 parameters and 1/3 FLOPs pic.twitter.com/TdtFJfW7w1
— AK (@ak92501) May 26, 2022

cv

by ak92501 on 2022-05-26 (UTC).

Pretraining is All You Need for Image-to-Image Translation
abs: https://t.co/AafrOKGSak
project page: https://t.co/jLrY13lF0N pic.twitter.com/w5fStjw1mm
— AK (@ak92501) May 26, 2022

cv research

by ak92501 on 2022-05-17 (UTC).

RTMV: A Ray-Traced Multi-View Synthetic Dataset for Novel View Synthesis
abs: https://t.co/4kdIL9g29r pic.twitter.com/JmXpCqFPXx
— AK (@ak92501) May 17, 2022

research cv

by ak92501 on 2022-05-13 (UTC).

Simple Open-Vocabulary Object Detection with Vision Transformers
abs: https://t.co/ytb2Tvliu1 pic.twitter.com/0xUjokjLcB
— AK (@ak92501) May 13, 2022

research cv

by ak92501 on 2022-05-09 (UTC).

.@Gradio Demo for CaptchaCracker, an open source Python library that provides functions to create and apply deep learning models for Captcha Image recognition on @huggingface Spaces
demo: https://t.co/jXbFUsSgqx
github: https://t.co/tEFCEB43uM pic.twitter.com/MsUisP4wBd
— AK (@ak92501) May 9, 2022

application w_code cv

by ak92501 on 2022-04-27 (UTC).

Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items
abs: https://t.co/eVRIkbLeqp pic.twitter.com/jRDQEYfYnV
— AK (@ak92501) April 27, 2022

dataviz cv

by ak92501 on 2022-04-27 (UTC).

Understanding The Robustness in Vision Transformers
abs: https://t.co/reQ35twd49

model achieves a state-of-the-art 87.1% accuracy and 35.8% mCE on ImageNet-1k and ImageNet-C with 76.8M parameters pic.twitter.com/H5pTsUpEE0
— AK (@ak92501) April 27, 2022

research cv

by ak92501 on 2022-04-25 (UTC).

High Quality Segmentation for Ultra High-resolution Images
abs: https://t.co/ljV7jl3olc
github: https://t.co/57fX0G15IM pic.twitter.com/SEZAHEQe8K
— AK (@ak92501) April 25, 2022

research w_code cv

by ak92501 on 2022-04-20 (UTC).

A Tour of Visualization Techniques for Computer Vision Datasets
abs: https://t.co/N0j9jlMZcC pic.twitter.com/bKzgtpj616
— AK (@ak92501) April 20, 2022

dataviz cv survey

Tag: cv

Tags