Ceshine's Data Science Tweet Collection

by _akhaliq on 2022-12-23 (UTC).

GENIE: Large Scale Pre-training for Text Generation with Diffusion Model
abs: https://t.co/7jEmtNaRzT pic.twitter.com/bhCrtjJYFH
— AK (@_akhaliq) December 23, 2022

research nlp diffusion

by PyTorchPractice on 2022-12-22 (UTC).

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM https://t.co/hpKlpbDiLE #deeplearning #machinelearning #ml #ai #neuralnetworks #datascience #PyTorch
— PyTorch Best Practices (@PyTorchPractice) December 22, 2022

w_code research nlp

by _akhaliq on 2022-12-22 (UTC).

X-Decoder: Generalized Decoding for Pixel, Image and Language

Hugging Face demo: https://t.co/TanVA1vBow
abs: https://t.co/3AtipzRWJT
project page: https://t.co/aqVdA3akGY
github: https://t.co/GNWGTBKd65 pic.twitter.com/0o03447dqy
— AK (@_akhaliq) December 22, 2022

research cv nlp

by ylecun on 2022-12-21 (UTC).

Deep Learning won't replace radiologists any time soon, but it sure looks like it's helping them and their patients. https://t.co/QB141zS292
— Yann LeCun (@ylecun) December 21, 2022

misc

by _akhaliq on 2022-12-21 (UTC).

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
abs: https://t.co/tUqIq7xpg7 pic.twitter.com/G2j8qmFAfV
— AK (@_akhaliq) December 21, 2022

research cv

by fermatslibrary on 2022-12-20 (UTC).

Holidays' reading list 📚🎁 Part 2

"The more comfortable we become with being stupid, the deeper we will wade into the unknown and the more likely we are to make big discoveries."

"The importance of stupidity in scientific research", an insightful read: https://t.co/iE404kMDkI pic.twitter.com/VF2vENr1fq
— Fermat's Library (@fermatslibrary) December 20, 2022

misc

by srush_nlp on 2022-12-20 (UTC).

Blog Post (w/ @gail_w): On "Thinking Like Transformers"

In which, I get a bit obsessed with learning how to code in Transformer lang🤖. https://t.co/Nb6G52vuiK

(You can follow along or do the exercises yourself in a colab notebook.) pic.twitter.com/Hk4aIxQr7l
— Sasha Rush (@srush_nlp) December 20, 2022

misc w_code

by _akhaliq on 2022-12-20 (UTC).

Optimizing Prompts for Text-to-Image Generation
abs: https://t.co/RIagcSFRgx
github: https://t.co/aqvAKuyryJ
Hugging Face: https://t.co/L0tuCre89O pic.twitter.com/lNrKKhiUBN
— AK (@_akhaliq) December 20, 2022

nlp research w_code

by _akhaliq on 2022-12-20 (UTC).

Scalable Diffusion Models with Transformers
abs: https://t.co/RlOulZLZ1U

largest DiT-XL/2 models outperform all prior diffusion models on the class conditional ImageNet 512×512 and 256×256 benchmarks, achieving a state-of-the-art FID of 2.27 on the latter pic.twitter.com/lGVFXn9keN
— AK (@_akhaliq) December 20, 2022

diffusion research cv

by jackclarkSF on 2022-12-19 (UTC).

Pretty eery: AI models learn to reflect user views back at them (since I figure getting low loss rewards monitoring the _context_ of whatever emitted the input tokens). Pretty weird to see it in the wild. LLMs seek to reflect the views of people that talk to them. https://t.co/gD5qbAwRUb
— Jack Clark (@jackclarkSF) December 19, 2022

misc nlp

by _akhaliq on 2022-12-19 (UTC).

Teaching Small Language Models to Reason
abs: https://t.co/FVIko6wVez pic.twitter.com/bhU8maOlVk
— AK (@_akhaliq) December 19, 2022

research nlp

by _akhaliq on 2022-12-19 (UTC).

ALERT: Adapting Language Models to Reasoning Tasks
abs: https://t.co/fC9hNJpKvc pic.twitter.com/TSbYLZYO2n
— AK (@_akhaliq) December 19, 2022

research nlp

Tags