Tweeted By @honnibal
Sure thing! You might skim this: https://t.co/nFI8DsHAIK . Now I just have a NN predicting the actions, instead of an averaged perceptron.
— Matthew Honnibal (@honnibal) September 14, 2018
It's sort of like reinforcement learning. To predict the next state, we need the current state to make the features.