Tweeted By @gdb
An agent which learned to play Mario without rewards. Instead, it was incentivized to avoid "boredom" (that is, getting into states where it can predict what will happen next). Discovered warp levels, how to defeat bosses, etc. More details: https://t.co/lGw3rZUbv3 pic.twitter.com/6ObS35iZZS
— Greg Brockman (@gdb) October 31, 2018