Tag - research

by _akhaliq on 2022-11-11 (UTC).

MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation
abs: https://t.co/dYJRDP1xCU
github: https://t.co/j95Y0of9GJ pic.twitter.com/DrH4rp5EKc
— AK (@_akhaliq) November 11, 2022

research nlp cv

by _akhaliq on 2022-11-11 (UTC).

What the DAAM: Interpreting Stable Diffusion Using Cross Attention

abs: https://t.co/Y3vYiZRcID
github: https://t.co/OUQPlQ5a5F @gradio demo: https://t.co/Feu06pQOix pic.twitter.com/PLBwanJOnv
— AK (@_akhaliq) November 11, 2022

research w_code

In a group with 16 other tweets.

by hardmaru on 2022-11-11 (UTC).

ZerO Initialization: Initializing Neural Networks with only Zeros and Ones

A fully deterministic initialization scheme which sets the weights to only 0s and 1s can achieve SOTA on various datasets including ImageNet. Maybe random weights are unnecessary.https://t.co/t6u3S6Dj71 pic.twitter.com/XgszDvat6T
— hardmaru (@hardmaru) November 11, 2022

learning research

by _akhaliq on 2022-11-11 (UTC).

StyleNAT: Giving Each Head a New Perspective
abs: https://t.co/3MTb1Nwtqn
github: https://t.co/EmY1eskFlP pic.twitter.com/pzQ8QWnk5Y
— AK (@_akhaliq) November 11, 2022

cv research w_code

by _akhaliq on 2022-11-10 (UTC).

Creative Writing with an AI-Powered Writing Assistant: Perspectives from Professional Writers
abs: https://t.co/hcNMnv39df pic.twitter.com/XzEYOUgA8U
— AK (@_akhaliq) November 10, 2022

research nlp

by tunguz on 2022-11-09 (UTC).

"Data Models for Dataset Drift Controls in Machine Learning With Images"

Paper: https://t.co/uYxu74SIeg
Code: https://t.co/zpG4oo7bBx
Dataset: https://t.co/ohrAEQA1nE #MachineLearning #DeepLearning #ArtificialIntelligence #ML #DL #AI

2/2
— Bojan Tunguz (@tunguz) November 9, 2022

research dataset w_code

by _akhaliq on 2022-11-08 (UTC).

How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers
abs: https://t.co/1jfwlUlsHt pic.twitter.com/BdUs7uMnSx
— AK (@_akhaliq) November 8, 2022

research

by _akhaliq on 2022-11-07 (UTC).

Rickrolling the Artist: Injecting Invisible Backdoors into Text-Guided Image Generation Models
abs: https://t.co/GaRXISObfs
github: https://t.co/l2mroWbLwT pic.twitter.com/CIz4kOW9Bx
— AK (@_akhaliq) November 7, 2022

research w_code

by _akhaliq on 2022-11-04 (UTC).

Large Language Models Are Human-Level Prompt Engineers
abs: https://t.co/eqNfe2o4JQ
project page: https://t.co/tusmJehApu pic.twitter.com/Vw7UU36XJF
— AK (@_akhaliq) November 4, 2022

research nlp

by _akhaliq on 2022-11-03 (UTC).

eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
abs: https://t.co/QaJfNzXeLL
project page: https://t.co/UrYmzGek8C

sota t2i diffusion model that consists of a base diffusion model and two super-resolution modules, 1024 × 1024 high-definition outputs pic.twitter.com/VN9qWla54v
— AK (@_akhaliq) November 3, 2022

research

by _akhaliq on 2022-11-03 (UTC).

On the detection of synthetic images generated by diffusion models
abs: https://t.co/TcyViS0RPH pic.twitter.com/1igXfAenCm
— AK (@_akhaliq) November 3, 2022

research cv

by _akhaliq on 2022-11-02 (UTC).

Text-Only Training for Image Captioning using Noise-Injected CLIP
abs: https://t.co/g4ZERgWtyW
github: https://t.co/HIpuHqCECx pic.twitter.com/Kxl4vHWHqx
— AK (@_akhaliq) November 2, 2022

research w_code

Tag: research

Tags