Tweeted By @ak92501
CogView: Mastering Text-to-Image Generation via Transformers
— AK (@ak92501) May 28, 2021
pdf: https://t.co/SOIqFhHleW
abs: https://t.co/nmhinD6kXK
demo: https://t.co/QULIeN628P
a 4-billion-parameter Transformer with VQ-VAE tokenizer, achieves a new SOTA FID on blurred MS COCO pic.twitter.com/hvXheo5cYP