2019-05 Open-ended Learning in Symmetric Zero-sum Games
https://arxiv.org/abs/1901.08106
PSRO
,
Game Theory
,
PBT
,
Self-Play
,
Open-ended Learning
,
co-evolution
,
RPS
,
DeepMind
,
David Balduzzi
,
Marta Garnelo
,
Yoram Bachrach
,
Wojciech M. Czarnecki
,
Julien Perolat
,
Max Jaderberg
,
Thore Graepel
,
2019