• 내용으로 건너뛰기

Out of the Box

사용자 도구

  • 로그인

사이트 도구

  • 최근 바뀜
  • 미디어 관리자
  • 사이트맵
추적: • 2020-10_assessing_game_balance_alphazero_exploring_alternative_rule_sets_chess • 2019-12_covariance_matrix_adaptation_for_the_rapid_illumination_of_behavior_space • 2024-10-31_project_sid_many-agent_simulations_toward_ai_civilization • 2024-01_stablelm-2-1.6b • 2016-08_popart_learning_values_across_many_orders_of_magnitude • 2021-07_epistemic_neural_networks • 2023-12_batched_low-rank_adaptation_of_foundation_models • 2023-10_mistral_7b • 2023-12_unicron_economizing_self-healing_llm_training_at_scale • start

tag:2018

TAG: 2018

  • 2017-06 Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
2021/12/21 15:25Hyunsoo Park
  • 2018-03 On First-Order Meta-Learning Algorithms
2021/07/20 07:57Hyunsoo Park
  • 2018-03 World Models
2020/07/30 15:30Hyunsoo Park
  • 2018-06 [MPO] Maximum a Posteriori Policy Optimisation
2021/08/24 14:02Hyunsoo Park
  • 2018-07 Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
2021/09/30 16:23Hyunsoo Park
  • 2018-10 Exploration by Random Network Distillation
2021/03/25 22:09Hyunsoo Park
  • 2018-11 QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
2021/07/21 22:27Hyunsoo Park
  • [GPT] Improving Language Understanding by Generative Pre-Training
2020/07/21 16:46Hyunsoo Park
  • DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
2020/07/22 14:58Hyunsoo Park
  • Revisiting Small Batch Training for Deep Neural Networks
2020/07/23 14:13Hyunsoo Park

문서 도구

  • 문서 보기
  • 이전 판
  • 역링크
  • Fold/unfold all
  • 맨 위로
별도로 명시하지 않을 경우, 이 위키의 내용은 다음 라이선스에 따라 사용할 수 있습니다: CC Attribution-Noncommercial-Share Alike 4.0 International
CC Attribution-Noncommercial-Share Alike 4.0 International Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki