2019-04 Evolving Rewards to Automate Reinforcement Learning
https://arxiv.org/abs/1905.07628
RL
,
EA
,
reward shaping
,
hyperparameter
,
Google
,
Aleksandra Faust
,
Dar Mehta
,
2019