Tweeted By @ylecun

on 2020-12-04 (UTC)
research tool rl

One Learning to RL them all:
ReBeL (Recursive Belief-based Learning) is a general RL+Search method that works for all two-player zero-sum games, including imperfect-information games (poker, liar's dice,...) and perfect-information games (chess, go....). https://t.co/2sw8Zbe8rg
— Yann LeCun (@ylecun) December 4, 2020

Tweeted By @ylecun

Tags