Multi-Concept Customization of Text-to-Image Diffusion
— AK (@_akhaliq) December 9, 2022
abs: https://t.co/SVbarg9tGE
project page: https://t.co/17oBqIWUBT pic.twitter.com/h7DNQxvMqg
Multi-Concept Customization of Text-to-Image Diffusion
— AK (@_akhaliq) December 9, 2022
abs: https://t.co/SVbarg9tGE
project page: https://t.co/17oBqIWUBT pic.twitter.com/h7DNQxvMqg
Unifying Vision, Text, and Layout for Universal Document Processing
— AK (@_akhaliq) December 7, 2022
abs: https://t.co/PEVsbf2cL0 pic.twitter.com/EkV3ssJ9Pd
Extensible Prompts for Language Models
— AK (@_akhaliq) December 2, 2022
abs: https://t.co/1Xq7BjENah pic.twitter.com/dkWyOHMLlP
Coder Reviewer Reranking for Code Generation
— AK (@_akhaliq) November 30, 2022
abs: https://t.co/0xaHE5jz4U pic.twitter.com/jKZHgKz409
Retrieval-Augmented Multimodal Language Modeling
— AK (@_akhaliq) November 24, 2022
abs: https://t.co/qSRxVEy546 pic.twitter.com/XLnBeDRoqN
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation
— AK (@_akhaliq) November 24, 2022
abs: https://t.co/0woOqb8Y2J
project page: https://t.co/DgUiEEdmaY pic.twitter.com/BtIaAiMZOL
Kandinsky 2.0 - multilingual text2image latent diffusion model
— AK (@_akhaliq) November 23, 2022
Kandinsky 2.0 was trained on a large 1B multilingual set@huggingface model: https://t.co/dehtM8GABm
github: https://t.co/ALuydTezHj pic.twitter.com/26OC4twFKm
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
— AK (@_akhaliq) November 23, 2022
abs: https://t.co/A4bmxam8p0
project page: https://t.co/CDIpU9hVGK pic.twitter.com/YbuDvkHVKl
Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark
— AK (@_akhaliq) November 23, 2022
abs: https://t.co/CgS0RDlI4U pic.twitter.com/A7Tplszo86
ClipCrop: Conditioned Cropping Driven by Vision-Language Model
— AK (@_akhaliq) November 22, 2022
abs: https://t.co/CzRBL0hgaz pic.twitter.com/IdfYMYGOeX
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
— AK (@_akhaliq) November 22, 2022
abs: https://t.co/Id4oqu0omU pic.twitter.com/hpmr6EtL8z
I Can't Believe There's No Images! Learning Visual Tasks Using only Language Data
— AK (@_akhaliq) November 18, 2022
abs: https://t.co/fxIgCaiDES pic.twitter.com/YzhEQ2CoTr