Tweeted By @ml_review

on 2019-08-25 (UTC)
research

Augmenting Self-attention with Persistent Memory
w/ @exgrv @GuillaumeLample

Replaces the feed-forward layer with persistent memory vectors.
Reduces the memory footprint of a transformer while preserving performance.https://t.co/2BcXQupBjt pic.twitter.com/2GlSGF9bzW
— ML Review (@ml_review) August 25, 2019

Tweeted By @ml_review

Tags