Homepage
Close
Menu

Site Navigation

  • Home
  • Archive(TODO)
    • By Day
    • By Month
  • About(TODO)
  • Stats
Close
by evolvingstuff on 2019-11-27 (UTC).

Single Headed Attention RNN: Stop Thinking With Your Head

"The final results are achievable in plus or minus 24 hours on a single GPU as the author is impatient."
"Take that Sesame Street."

paper: https://t.co/6vs5U0p71W
code: https://t.co/uWRPylK1tj pic.twitter.com/dT6v2DounV

— Thomas Lahore (@evolvingstuff) November 27, 2019
research
by Smerity on 2019-11-27 (UTC).

Introducing the SHA-RNN :)
- Read alternative history as a research genre
- Learn of the terrifying tokenization attack that leaves language models perplexed
- Get near SotA results on enwik8 in hours on a lone GPU
No Sesame Street or Transformers allowed.https://t.co/oCArjFKVDK pic.twitter.com/RN5TPZ3xWH

— Smerity (@Smerity) November 27, 2019
research
by slashML on 2019-11-27 (UTC).

"Single Headed Attention RNN: Stop Thinking With Your Head": Take that Sesame Street! https://t.co/G92vAoSkbp

— /MachineLearning (@slashML) November 27, 2019
nlpresearch

Tags

learning tutorial misc nlp rstats gan ethics research dataviz survey python tool security kaggle video thought bayesian humour tensorflow w_code bias dataset pytorch cv tip application javascript forecast swift golang rl jax julia gnn causal surey diffusion
© Copyright Philosophy 2018 Site Template by Colorlib