====== QMix ====== ===== QTRAN ===== * [[https://arxiv.org/pdf/1905.05408.pdf|QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement learning]] * https://github.com/oxwhirl/pymarl * Off-policy 데이터는 불안정함 * [[https://arxiv.org/pdf/2006.00587.pdf|Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning]] * [[https://arxiv.org/pdf/1702.08887.pdf|Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning]] * [[https://www.aaai.org/Papers/AAAI/2020GB/AAAI-WenC.2318.pdf|SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning]]