Tweeted By @dennybritz

on 2019-01-25 (UTC)
research

a group with 3 other tweets.

Why doesn’t RL use more toy tasks to measure advances in specific aspects of a problem like long term planning, large action spaces, imperfect information, etc?

Complex environments such as Starcraft are impressive but make it difficult to disentangle *why* an agent wins.
— Denny Britz (@dennybritz) January 25, 2019

Tags