Tweeted By @ylecun
One Learning to RL them all:
— Yann LeCun (@ylecun) December 4, 2020
ReBeL (Recursive Belief-based Learning) is a general RL+Search method that works for all two-player zero-sum games, including imperfect-information games (poker, liar's dice,...) and perfect-information games (chess, go....). https://t.co/2sw8Zbe8rg