Tweeted By @svlevine

on 2021-01-10 (UTC)
rl research

Offline model-based RL for goal reaching: learn a distance "Q-like" function from offline data, and a video prediction model, then use them to accomplish visually indicated goals.

w/ Stephen Tian et al.https://t.co/pmXL8fGHXv https://t.co/x9XXI7PN06

🧵> pic.twitter.com/G3t23nBWXo
— Sergey Levine (@svlevine) January 10, 2021

Tweeted By @svlevine

Tags