Tweeted By @ak92501

on 2021-11-04 (UTC)
research

VLMO: Unified Vision-Language Pre-Training with
Mixture-of-Modality-Experts
abs: https://t.co/Rv9o8aFIdI

introduce Mixture-of-Modality-Experts Transformer,
where each block contains a pool of modality-specific experts and a shared self attention layer pic.twitter.com/4k0YFlvgsR
— AK (@ak92501) November 4, 2021

Tweeted By @ak92501

Tags