GPT-NeoX-20B, 20 billion parameter large language model made freely available to public, with candid report on strengths, limits, ecological costs, etc.
β Gary Marcus (@GaryMarcus) February 10, 2022
Genuinely Open AI https://t.co/kwcCtNQiGD
GPT-NeoX-20B, 20 billion parameter large language model made freely available to public, with candid report on strengths, limits, ecological costs, etc.
β Gary Marcus (@GaryMarcus) February 10, 2022
Genuinely Open AI https://t.co/kwcCtNQiGD
GiraffeDet: A Heavy-Neck Paradigm for Object Detection
β AK (@ak92501) February 10, 2022
abs: https://t.co/Mrtg47qSpG pic.twitter.com/OHpF2tcGmq
Context Autoencoder for Self-Supervised Representation Learning
β AK (@ak92501) February 8, 2022
abs: https://t.co/ziVP3wwUAo pic.twitter.com/02PN3y6hp9
Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
β AK (@ak92501) February 8, 2022
abs: https://t.co/UlC8dDNdMm pic.twitter.com/pLIrOyQOTp
This is sweet π₯§ !https://t.co/fUarOS3CNn
β Martin GΓΆrner (@martin_gorner) February 4, 2022
Finally a solid way of of teaching a neural network to know what it does not know.
(OOD = Out Of Domain, i.e. not one of the classes in the training data.) Congrats @SharonYixuanLin @xuefeng_du @MuCai7 pic.twitter.com/x3DEX7Y6Hl
CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series Forecasting
β AK (@ak92501) February 4, 2022
abs: https://t.co/4jk8lps3pJ pic.twitter.com/WJmZRWrYMk
ETSformer: Exponential Smoothing Transformers for Time-series Forecasting
β AK (@ak92501) February 4, 2022
abs: https://t.co/ZtpXPqhlhF pic.twitter.com/dSPGXgcAid
Pre-Trained Language Models for Interactive Decision-Making
β AK (@ak92501) February 4, 2022
abs: https://t.co/uECv8kutrE
project page: https://t.co/Bf3iqgfcA9 pic.twitter.com/OLSIiOxX2S
Unified Scaling Laws for Routed Language Models
β AK (@ak92501) February 3, 2022
abs: https://t.co/C4zMJcB2wg pic.twitter.com/LoKuIVW617
Competition-Level Code Generation with AlphaCode
β AK (@ak92501) February 2, 2022
paper: https://t.co/Np8uy6UE3R
blog: https://t.co/ATpcgHNeGB pic.twitter.com/x3iGv5UjBM
WebFormer: The Web-page Transformer for Structure Information Extraction
β AK (@ak92501) February 2, 2022
abs: https://t.co/d6y4TEFw2h pic.twitter.com/CgMiVVAtyS
COIN++: Data Agnostic Neural Compression
β AK (@ak92501) February 1, 2022
abs: https://t.co/BvWvL962Vg pic.twitter.com/bIEMWjciJb