Tag - research

by ak92501 on 2022-03-17 (UTC).

Delta Tuning: A Comprehensive Study of Parameter
Efficient Methods for Pre-trained Language Models
abs: https://t.co/8cPUIX2Rfh pic.twitter.com/nPOErYQcfH
— AK (@ak92501) March 17, 2022

nlp research

by GoogleAI on 2022-03-15 (UTC).

Introducing the Multimodal Bottleneck Transformer, a novel transformer-based model for multimodal fusion that restricts cross-modal attention flow to achieve state-of-the-art results on video classification tasks with less compute. Read more ↓ https://t.co/BXMVgap0ID pic.twitter.com/Pb8b3j1A5N
— Google AI (@GoogleAI) March 15, 2022

research

by hardmaru on 2022-03-13 (UTC).

“Model soups”: Averaging the weights of multiple models fine-tuned with different hyperparameter configurations improves accuracy and robustness, without increasing inference time! @mitchnw et al.https://t.co/QJ4f4MvTHu
— hardmaru (@hardmaru) March 13, 2022

research

by NandoDF on 2022-03-13 (UTC).

This paper has a very clear presentation of different attention architectures in transformers. I’d be thankful if people could share their experience in trying multi-query vs standard multi-head attention. Thanks https://t.co/aY1AW5etWI
— Nando de Freitas 🏳️‍🌈 (@NandoDF) March 13, 2022

research

by ak92501 on 2022-03-10 (UTC).

Temporal Difference Learning for Model Predictive Control
abs: https://t.co/WaBa3e5J0s
project page: https://t.co/30YjYkXjwW pic.twitter.com/Yz0Wg4lGDO
— AK (@ak92501) March 10, 2022

research

by ak92501 on 2022-03-10 (UTC).

On the surprising tradeoff between ImageNet
accuracy and perceptual similarity
abs: https://t.co/FAWkG1OIX5

show that an inverse-U relationship exists between accuracy and PS across a number of settings pic.twitter.com/GHnj2MJUgP
— AK (@ak92501) March 10, 2022

research cv

by ak92501 on 2022-03-09 (UTC).

EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
abs: https://t.co/Ju4EJMasSZ pic.twitter.com/wZSb4v8ZVv
— AK (@ak92501) March 9, 2022

research cv

by ak92501 on 2022-03-08 (UTC).

The (Un)Surprising Effectiveness of Pre-Trained Vision Models for Control
abs: https://t.co/kFVZx80f2u pic.twitter.com/Tm723A7aqC
— AK (@ak92501) March 8, 2022

research rl cv

by ak92501 on 2022-03-07 (UTC).

DiT: Self-supervised Pre-training for Document Image Transformer
abs: https://t.co/OUQ94iQ6dY

achieves sota results on downstream tasks, e.g. document image classification (91.11 → 92.69), document layout analysis (91.0 → 94.9) and table detection (94.23 → 96.55) pic.twitter.com/uZWAMGh71s
— AK (@ak92501) March 7, 2022

research cv

by ak92501 on 2022-03-03 (UTC).

TableFormer: Table Structure Understanding with Transformers
abs: https://t.co/RiMdYmdstj pic.twitter.com/gapwX8EgKz
— AK (@ak92501) March 3, 2022

research

by ak92501 on 2022-03-03 (UTC).

HyperPrompt: Prompt-based Task-Conditioning of Transformers
abs: https://t.co/OOQLIlBIv3

HyperPrompt achieves sota performance on SuperGLUE for T5 models up to XXL pic.twitter.com/Ic1XlqZiqO
— AK (@ak92501) March 3, 2022

research nlp

by GoogleAI on 2022-02-28 (UTC).

Introducing a new approach for training #ML models using noisy data that works by dynamically assigning importance weights to both individual instances and class labels, thus reducing the impact of noisy examples. Learn more about it at https://t.co/lKYl0fzeYD pic.twitter.com/ySCm1HAzKT
— Google AI (@GoogleAI) February 28, 2022

dataset research

Tag: research

Tags