Tweeted By @KaiLashArul
Temporal Value Transport - new heuristic for dealing with long-term credit assignment in RL with memory-augmented NNs: https://t.co/vPImOO3DPJ . Work like this and RUDDER are trying to address a fundamental problem with RL. pic.twitter.com/nVY7SRPDXn
— Kaixhin (@KaiLashArul) October 17, 2018