MPO
Example: V-MPO
Duality — A New Approach to Reinforcement Learning
2020-05 [MO-VMPO] A Distributional View on Multi-Objective Policy Optimization
2019-10 [VMPO] V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
2018-06 [MPO] Maximum a Posteriori Policy Optimisation
Example: MO-V-MPO