PPO Dash: Improving Generalization in Deep Reinforcement Learning