Tweeted By @DeepMindAI
Bayesian Action Decoder (https://t.co/tpnSgBPPA3): A new multi-agent RL method for learning to communicate via informative actions using ToM-like reasoning. Achieves the best known score for 2 players on the challenging #hanabigame
— DeepMind (@DeepMindAI) November 6, 2018