Tag - research

by ak92501 on 2021-09-28 (UTC).

FewNLU: Benchmarking State-of-the-Art Methods
for Few-Shot Natural Language Understanding
pdf: https://t.co/IhdNEnGWo8
abs: https://t.co/YO2hqCTWnk pic.twitter.com/eF7icGRVbA
— AK (@ak92501) September 28, 2021

research nlp

by rasbt on 2021-09-27 (UTC).

"torch.manual seed(3407) is all you need: On the influence of random seeds in deep learning architectures for computer vision" https://t.co/LoQhOzpbVw. Results are actually not as bad as the title makes it seem. But yeah, reporting std dev or CIs should be(come) the default. pic.twitter.com/K2JElhgwCs
— Sebastian Raschka (@rasbt) September 27, 2021

research

by ak92501 on 2021-09-27 (UTC).

Transformers Generalize Linearly
abs: https://t.co/ud0iUEYDyx

Transformers fail to generalize hierarchically across a wide variety of grammatical mapping tasks, but they exhibit an even stronger preference for linear generalization than comparable recurrent networks pic.twitter.com/VzbM2SQTZl
— AK (@ak92501) September 27, 2021

research

by ak92501 on 2021-09-25 (UTC).

Muzic: Music Understanding and Generation with Artificial Intelligence
github: https://t.co/2XtiUh3h8e pic.twitter.com/4PTPX4qMia
— AK (@ak92501) September 25, 2021

research w_code application

by karpathy on 2021-09-24 (UTC).

Amusing! Object detection cast naively into language modeling framework + borrowing many of the tips&tricks.
- random object ordering seems fine ✅
- coords, class labels flattened into a single softmax 😂
- sequence augmentation is the most gnarly part, almost as yucky as nms 😬 https://t.co/FxSz5UbpxY
— Andrej Karpathy (@karpathy) September 24, 2021

research cv nlp

In a group with 1 other tweets.

by tingchenai on 2021-09-23 (UTC).

Have you wondered why object detection, unlike classification, has so many sophisticated algorithms?

With Pix2Seq (https://t.co/ygsG3aAIbG), we simply cast object detection as a language modeling task conditioned on pixels!

(with @srbhsxn, Lala Li, @fleet_dj, @geoffreyhinton) pic.twitter.com/aTYZ5IvJc9
— Ting Chen (@tingchenai) September 23, 2021

research cv nlp

In a group with 1 other tweets.

by ak92501 on 2021-09-22 (UTC).

Neural Distance Embeddings for Biological Sequences
abs: https://t.co/fvlN4fcGpB
github: https://t.co/Z0Ok68cWhH pic.twitter.com/nmNGqO8IRy
— AK (@ak92501) September 22, 2021

research w_code

by ak92501 on 2021-09-20 (UTC).

Primer: Searching for Efficient Transformers for Language Modeling
abs: https://t.co/JM9v7pNoSI
github: https://t.co/xhA7uGyC7H
Experiments show Primer’s gains over Transformer increase as compute scale grows and follow a power law with respect to quality at optimal model sizes pic.twitter.com/CXq1yYMfUA
— AK (@ak92501) September 20, 2021

research nlp

by tanmingxing on 2021-09-16 (UTC).

Wish your neural networks faster and more accurate?

Check out our recent EfficientNetV2 and CoAtNet, which significantly speed up the training and inference, while achieving state-of-the-art 90.88% top-1 accuracy on ImageNet. https://t.co/9buCSZmYby
— Mingxing Tan (@tanmingxing) September 16, 2021

research cv

by ak92501 on 2021-09-13 (UTC).

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
abs: https://t.co/A3s6zdW0Iu

a simple yet effective method that Prompts GPT3 via the use of Image Captions. Using only 16 examples, PICa surpasses the supervised sota by an absolute +8.6 points on the OK-VQA dataset pic.twitter.com/9HKybsk1qu
— AK (@ak92501) September 13, 2021

research nlp

by ak92501 on 2021-09-10 (UTC).

ConvMLP: Hierarchical Convolutional MLPs for Vision
pdf: https://t.co/f6c1XmyLSX
abs: https://t.co/vkgXvlCmcD
github: https://t.co/FyjNR8W3Oq pic.twitter.com/RXXJgzoula
— AK (@ak92501) September 10, 2021

research cv w_code

by ak92501 on 2021-09-10 (UTC).

Efficient Nearest Neighbor Language Models
pdf: https://t.co/uEtibkYA1L
abs: https://t.co/K0hBVhtvlk pic.twitter.com/ZKd0vnBDCs
— AK (@ak92501) September 10, 2021

research nlp

Tag: research

Tags