Tag - research

by ak92501 on 2022-04-13 (UTC).

What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?
abs: https://t.co/Lk71qAPdzm
github: https://t.co/hIzImwwFoD pic.twitter.com/NSI294Gs7M
— AK (@ak92501) April 13, 2022

research w_code nlp

by DeepMind on 2022-04-12 (UTC).

Chinchilla: A 70 billion parameter language model that outperforms much larger models, including Gopher. By revisiting how to trade-off compute between model & dataset size, users can train a better and smaller model. Read more: https://t.co/RaZGUclBYQ 1/3 pic.twitter.com/TNWI1RLloA
— DeepMind (@DeepMind) April 12, 2022

research nlp

by rasbt on 2022-04-12 (UTC).

"Machine Learning State-of-the-Art with Uncertainties" -- great paper by @psteinb_ & @helmholtz_ai
making a case for confidence intervals in ML benchmarks, or really any ML work. And no, adding CI's (e.g. via normal approx.) doesn't have to be expensive :) https://t.co/pgW6ILD7JW https://t.co/n1LMuCx1RQ
— Sebastian Raschka (@rasbt) April 12, 2022

research

by ak92501 on 2022-04-12 (UTC).

No Token Left Behind: Explainability-Aided Image Classification and Generation
abs: https://t.co/n5Jeu5Q8c7 pic.twitter.com/hLvkQgVFrr
— AK (@ak92501) April 12, 2022

research cv

by ak92501 on 2022-04-07 (UTC).

Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
abs: https://t.co/aL2vCMoyEp
github: https://t.co/xyk5vVRzvU pic.twitter.com/zFqHIngLwu
— AK (@ak92501) April 7, 2022

research cv

by ak92501 on 2022-04-07 (UTC).

Temporal Alignment Networks for Long-term Video
abs: https://t.co/8VRuU21Lgg pic.twitter.com/wM72irpZQ5
— AK (@ak92501) April 7, 2022

research cv

by ak92501 on 2022-04-07 (UTC).

MixFormer: Mixing Features across Windows and Dimensions
abs: https://t.co/3cLfqEzNfl pic.twitter.com/FUtMJS3p3o
— AK (@ak92501) April 7, 2022

research

by ak92501 on 2022-04-07 (UTC).

KNN-Diffusion: Image Generation via Large-Scale Retrieval
abs: https://t.co/3E0f0wXBkI pic.twitter.com/78RHYZfpaC
— AK (@ak92501) April 7, 2022

research cv

by ak92501 on 2022-04-06 (UTC).

PaLM: Scaling Language Modeling with Pathways
abs: https://t.co/yWvL0NGyjB pic.twitter.com/ACu4cVqAGO
— AK (@ak92501) April 6, 2022

research nlp

In a group with 1 other tweets.

by ak92501 on 2022-04-05 (UTC).

MultiMAE: Multi-modal Multi-task Masked Autoencoders
abs: https://t.co/HrnyoHP9Xz
project page: https://t.co/NRdhfhYPCy pic.twitter.com/BADf1UMd3J
— AK (@ak92501) April 5, 2022

research

by ak92501 on 2022-04-04 (UTC).

WavFT: Acoustic model finetuning with labelled and unlabelled data
abs: https://t.co/Feck7OBQ9i pic.twitter.com/DyfyXv24AF
— AK (@ak92501) April 4, 2022

research

by tunguz on 2022-03-30 (UTC).

XGBoost Is All You Need

Deep Neural Networks and Tabular Data: A Surveyhttps://t.co/Z2KsHP3fvp pic.twitter.com/uh5NLS1fVP
— Bojan Tunguz (@tunguz) March 30, 2022

research survey learning

Tag: research

Tags