NAPPO: Modular and scalable reinforcement learning in pytorch

https://arxiv.org/abs/2007.02622
https://youtu.be/L442rrVnDr4

NAPPO, PPO, distributed computing, obstacle tower, Unity3D, ml agent