Homepage
Close
Menu

Site Navigation

  • Home
  • Archive(TODO)
    • By Day
    • By Month
  • About(TODO)
  • Stats
Close
by _akhaliq on 2022-09-23 (UTC).

Mega: Moving Average Equipped Gated Attention
abs: https://t.co/HxcNRQFg16 pic.twitter.com/gFuvKJOYvy

— AK (@_akhaliq) September 23, 2022
research
by gneubig on 2022-09-23 (UTC).

MEGA is a new method for modeling long sequences based on the surprisingly simple technique of taking the moving average of embeddings.

Excellent results, outperforming strong competitors such as S4 on most tasks! Strongly recommend that you check it out: https://t.co/Y07MSd0hhc https://t.co/9xbmaF5XCr pic.twitter.com/pX01RctxDN

— Graham Neubig (@gneubig) September 23, 2022
researchnlp

Tags

learning tutorial misc nlp rstats gan ethics research dataviz survey python tool security kaggle video thought bayesian humour tensorflow w_code bias dataset pytorch cv tip application javascript forecast swift golang rl jax julia gnn causal surey diffusion
© Copyright Philosophy 2018 Site Template by Colorlib