QMix
QTRAN
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement learning
https://github.com/oxwhirl/pymarl
Off-policy 데이터는 불안정함
Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning
SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning