MAGVIT: Masked Generative Video Transformer
— AK (@_akhaliq) December 13, 2022
abs: https://t.co/LOuF71lgLl
project page: https://t.co/o1uf6BsCbB pic.twitter.com/rHfhtUqeBB
MAGVIT: Masked Generative Video Transformer
— AK (@_akhaliq) December 13, 2022
abs: https://t.co/LOuF71lgLl
project page: https://t.co/o1uf6BsCbB pic.twitter.com/rHfhtUqeBB
Seeing a Rose in Five Thousand Ways
— AK (@_akhaliq) December 12, 2022
abs: https://t.co/g3DPhg4FLY pic.twitter.com/GUZWdmjmQF
VideoDex: Learning Dexterity from Internet Videos
— AK (@_akhaliq) December 9, 2022
abs: https://t.co/j9g9QJ8X2L
project page: https://t.co/2CwgvxlDqo pic.twitter.com/5FunjX44NY
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
— AK (@_akhaliq) December 9, 2022
abs: https://t.co/t9DOAdJUgU
project page: https://t.co/G4PGZCKeGQ pic.twitter.com/H3ugxkPaR5
Multi-Concept Customization of Text-to-Image Diffusion
— AK (@_akhaliq) December 9, 2022
abs: https://t.co/SVbarg9tGE
project page: https://t.co/17oBqIWUBT pic.twitter.com/h7DNQxvMqg
ADIR: Adaptive Diffusion for Image Reconstruction
— AK (@_akhaliq) December 7, 2022
abs: https://t.co/Nu3gci1obX
project page: https://t.co/6VZRYYAAxZ pic.twitter.com/5KiZFxqFrg
Image Deblurring with Domain Generalizable Diffusion Models
— AK (@_akhaliq) December 6, 2022
abs: https://t.co/nXYfjxncpc pic.twitter.com/iF1O0CMXik
GRiT: A Generative Region-to-text Transformer for Object Understanding
— AK (@_akhaliq) December 2, 2022
abs: https://t.co/unwWFbacj7
github: https://t.co/B4pT5zoWyD pic.twitter.com/ORHqjJ8yEC
RGB no more: Minimally-decoded JPEG Vision Transformers
— AK (@_akhaliq) November 30, 2022
abs: https://t.co/w1WWGbPe5B pic.twitter.com/aEov50TPsD
Magic Poser + Stable Diffusion 2.0’s Depth-to-Image
— hardmaru (@hardmaru) November 26, 2022
A blog post detailing the workflow of combining a pose maker software with Stable Diffusion 2.0’s Depth-to-Image model to have full control over the generated image.#StableDiffusion #StableDiffusion2https://t.co/VjtRZIfaBL pic.twitter.com/MGzG9cLJhW
Retrieval-Augmented Multimodal Language Modeling
— AK (@_akhaliq) November 24, 2022
abs: https://t.co/qSRxVEy546 pic.twitter.com/XLnBeDRoqN
Latent Video Diffusion Models for High-Fidelity Video Generation with Arbitrary Lengths
— AK (@_akhaliq) November 24, 2022
abs: https://t.co/TKMdZJTuef
project page: https://t.co/GBj7G7ddOT pic.twitter.com/BF08kAA2Yo