Tweeted By @svlevine

on 2019-06-13 (UTC)
research w_code

Can we use reinforcement learning together with search to solve temporally extended tasks? In Search on the Replay Buffer (w/ Ben Eysenbach and @rsalakhu), we use goal-conditioned policies to build a graph for search.

Paper: https://t.co/qMEHyU06mU
Colab: https://t.co/jqADqWeJEu
— Sergey Levine (@svlevine) June 13, 2019

Tweeted By @svlevine

Tags