drl:start

차이

문서의 선택한 두 판 사이의 차이를 보여줍니다.

차이 보기로 링크

--- drl:start [2017/04/27 04:05] – [시뮬레이션] rex8312
+++ drl:start [2024/03/23 02:42] (현재) – 바깥 편집 127.0.0.1
@@ 줄 1: / 줄 1: @@
-===== General Video Game Playing 관련연구 목록 =====
+===== General Video Game Playing =====
+==== Learning Algorithm ====
+  * [[https://arxiv.org/abs/1704.04651|The Reactor: A Sample-Efficient Actor-Critic Architecture]]
+==== Reward Shaping ====
+  * [[https://arxiv.org/abs/1611.05397|Reinforcement Learning with Unsupervised Auxiliary Tasks]]
 ==== Simulation ====
@@ 줄 28: / 줄 36: @@
   * Progressive Network
     * A. A. Rusu et al., “Progressive neural networks,” arXiv preprint arXiv:1606.04671, 2016.
+  * [[https://arxiv.org/pdf/1310.8499.pdf|2014-05, Deep AutoRegressive Networks]]
@@ 줄 51: / 줄 60: @@
 ^Dueling Network | Z. Wang, N. de Freitas, and M. Lanctot, “Dueling network architectures for deep reinforcement learning,” arXiv preprint arXiv:1511.06581, 2015.|
-==== Memory ====
-^Neural Turing Machine  |A. Graves, G. Wayne, and I. Danihelka, “Neural Turing Machines,” arXiv:1410.5401 [cs], Oct. 2014.|
-^External Memory |A. Graves et al., “Hybrid computing using a neural network with dynamic external memory,” Nature, vol. 538, no. 7626, pp. 471–476, Oct. 2016. |
-==== Reward Shaping ====
-^[[DRDL:UNREAL]] |[1]M. Jaderberg et al., “Reinforcement Learning with Unsupervised Auxiliary Tasks,” arXiv:1611.05397 [cs], Nov. 2016. |
 ==== AI Platform ====
@@ 줄 72: / 줄 76: @@
 ^Anold |G. Lample and D. S. Chaplot, “Playing FPS Games with Deep Reinforcement Learning,” arXiv:1609.05521 [cs], Sep. 2016. |
+===== 구현체 =====
-{{tag>DRL deep-learning reinforcement-learning}}
+  * https://github.com/google/dopamine

drl/start.1493265908.txt.gz · 마지막으로 수정됨: 2024/03/23 02:38 (바깥 편집)