Policy Optimization by Genetic Distillation
https://arxiv.org/abs/1711.01012
EA RL
,
GPO
,
policy distillation
,
EA
,
GA
,
RL