Tweeted By @svlevine
Offline model-based RL for goal reaching: learn a distance "Q-like" function from offline data, and a video prediction model, then use them to accomplish visually indicated goals.
— Sergey Levine (@svlevine) January 10, 2021
w/ Stephen Tian et al.https://t.co/pmXL8fGHXvhttps://t.co/x9XXI7PN06
🧵> pic.twitter.com/G3t23nBWXo