Tweeted By @jackclarkSF
Some megascale RL results from @OpenAI:
— Jack Clark (@jackclarkSF) June 25, 2018
We've scaled existing methods to train AIs with sufficient teamwork skills to solve hard problems within Dota 2
- Scaled-up PPO+LSTM
~120,000 CPUs + 256 GPUs
- Self-play
- Hyperparameter called "Team Spirit" to teach AIs to collaborate https://t.co/lcSGWw0yr5