Tweeted By @tdietterich
"this" meaning sampling and simulating unlimited numbers of steps. My experience applying RL to wildfire management suggests that the number of required simulations is infeasibly large even for the massive farms Google is using, hence the claim is meaningless
— Thomas G. Dietterich (@tdietterich) August 7, 2018