Tweeted By @dennybritz
Why doesn’t RL use more toy tasks to measure advances in specific aspects of a problem like long term planning, large action spaces, imperfect information, etc?
— Denny Britz (@dennybritz) January 25, 2019
Complex environments such as Starcraft are impressive but make it difficult to disentangle *why* an agent wins.