Tag - cv

by ak92501 on 2021-03-18 (UTC).

ALADIN: All Layer Adaptive Instance Normalization for Fine-grained Style Similarity
pdf: https://t.co/K7homQW2GM
abs: https://t.co/jTp3DS4u7o pic.twitter.com/NUBfHbZZWE
— AK (@ak92501) March 18, 2021

research cv

by ak92501 on 2021-03-17 (UTC).

Is it Enough to Optimize CNN Architectures on ImageNet?
pdf: https://t.co/zC5jToLTto
abs: https://t.co/oIYWstLrIf pic.twitter.com/RrEHLNJEqa
— AK (@ak92501) March 17, 2021

cv research

by ak92501 on 2021-03-16 (UTC).

Revisiting ResNets: Improved Training and Scaling Strategies
pdf: https://t.co/Pn5cU2SVkB
abs: https://t.co/icpnuFwmXU pic.twitter.com/bA0E1GWR5z
— AK (@ak92501) March 16, 2021

research cv

by facebookai on 2021-03-15 (UTC).

Facebook AI has built TimeSformer, a new architecture for video understanding. It’s the first based exclusively on the self-attention mechanism used in Transformers. It outperforms the state of the art while being more efficient than 3D ConvNets for video.https://t.co/8mQ2rMgcDo pic.twitter.com/dBpbT3UJRx
— Facebook AI (@facebookai) March 15, 2021

research cv

by emidup on 2021-03-10 (UTC).

We introduce a new approach for image compression: instead of storing the pixels in an image, we store the weights of an MLP overfitted to the image 🌟 At low bit-rates this can do better than JPEG!https://t.co/ATIyOEiwNX

with @adam_golinski @notmilad @yeewhye @ArnaudDoucet1 pic.twitter.com/5sVBc2oST5
— Emilien Dupont (@emidup) March 10, 2021

research cv

by PyTorch on 2021-03-09 (UTC).

Introducing VISSL (https://t.co/iBEpmCi09R) - a library for reproducible, SOTA self-supervised learning for computer vision! Over 10 methods implemented, 60 pre-trained models, 15 benchmarks, and counting. pic.twitter.com/ZZMd8DpHBD
— PyTorch (@PyTorch) March 9, 2021

tool cv pytorch

by jbhuang0604 on 2021-03-07 (UTC).

First Principles of Computer Vision by Shree Nayar.

In the era of deep learning everything, understanding the fundamentals is more important than ever!https://t.co/wQvdIXC8TM
— Jia-Bin Huang (@jbhuang0604) March 7, 2021

learning cv video

by facebookai on 2021-03-04 (UTC).

Detectron2Go (D2Go) is a new, state-of-the-art extension for Detectron2 that gives developers an end-to-end pipeline for training and deploying object detection models on mobile devices and hardware.https://t.co/SjtH6PWBQq pic.twitter.com/QeZE4rR74w
— Facebook AI (@facebookai) March 4, 2021

cv tool research

by ylecun on 2021-03-04 (UTC).

SEER: large-scale SSL for vision.
- pre-train via SSL on 1 billion randomly selected images using SwAV.
- fine-tune on ImageNet: 84.2% top-1 accuracy.
- ft on 10% of ImageNet: 77.9%
- ft on 1% (13 samples per class): 60.5%
- beats SOTA on other CV taskshttps://t.co/Q8BT5QvKmf
— Yann LeCun (@ylecun) March 4, 2021

research cv

In a group with 1 other tweets.

by ylecun on 2021-03-04 (UTC).

A new blog post I wrote with Ishan Misra.
An overview of Self-Supervised Learning.

We look at recent progress in SSL for vision & explain why SSL is more challenging with high-D continuous signals (images, video) than it is for discrete signals (text).https://t.co/DlL885CPpb
— Yann LeCun (@ylecun) March 4, 2021

cv learning nlp survey

by ak92501 on 2021-03-03 (UTC).

WIT: Wikipedia-based Image Text Dataset for Multimodal
Multilingual Machine Learning
pdf: https://t.co/fblyzH2hGe
abs: https://t.co/tVgBdfOnQ5
github: https://t.co/NNkF3oheok pic.twitter.com/nnFUaPJaYU
— AK (@ak92501) March 3, 2021

research cv nlp w_code

by benhamner on 2021-03-02 (UTC).

Our newest @kaggle competition is OCR for chemical compounds. Can you apply ML to translate from an image of the chemical structure to the text string that represents it? 4 million chemical structure images to help solve this problem! https://t.co/YpnGWsxczk
— Ben Hamner (@benhamner) March 2, 2021

kaggle cv dataset

Tag: cv

Tags