• 내용으로 건너뛰기

Out of the Box

사용자 도구

  • 로그인

사이트 도구

  • 최근 바뀜
  • 미디어 관리자
  • 사이트맵
추적:

tag:llm

역링크

현재 문서를 가리키는 링크가 있는 문서 목록입니다.

  • review:2023-01_gpt_in_60_lines_of_numpy
  • review:2023-03_a_survey_of_large_language_models
  • review:2023-03_chatgpt4pcg_competition_character-like_level_generation_for_science_birds
  • review:2023-04_generative_agents_interactive_simulacra_of_human_behavior
  • review:2023-05_improving_language_model_negotiation_with_self-play_and_in-context_learning_from_ai_feedback
  • review:2023-06_a_technical_report_for_polyglot-ko_open-source_large-scale_korean_language_models
  • review:2023-06_secrets_of_rlhf_in_large_language_models_part_i_ppo
  • review:2023-07_polylm_an_open_source_polyglot_large_language_model
  • review:2023-08_jiang_chinese_open_foundation_language_model
  • review:2023-10_a_general_theoretical_paradigm_to_understand_learning_from_human_preferences
  • review:2023-10_large_language_models_as_generalizable_policies_for_embodied_tasks
  • review:2023-10_mistral_7b
  • review:2023-10_vanishing_gradients_in_reinforcement_finetuning_of_language_models
  • review:2023-12_batched_low-rank_adaptation_of_foundation_models
  • review:2023-12_diloco_distributed_low-communication_training_of_language_models
  • review:2023-12_direct_preference_optimization_your_language_model_is_secretly_a_reward_model
  • review:2023-12_efficient_large_language_models_a_survey
  • review:2023-12_llm-powered_hierarchical_language_agent_for_real-time_human-ai_coordination
  • review:2023-12_speeding_up_the_gpt_-_kv_cache
  • review:2023-12_unicron_economizing_self-healing_llm_training_at_scale
  • review:2024-01_agent_alignment_in_evolving_social_norms
  • review:2024-01_args_alignment_as_reward-guided_search
  • review:2024-01_asynchronous_local-sgd_training_for_language_modeling
  • review:2024-01_a_minimaximalist_approach_to_reinforcement_learning_from_human_feedback
  • review:2024-01_biofinbert_finetuning_large_language_models_llms_to_analyze_sentiment_of_press_releases_and_financial_text_around_inflection_points_of_biotech_stocks
  • review:2024-01_code_generation_with_alphacodium_from_prompt_engineering_to_flow_engineering
  • review:2024-01_deepseekmoe_towards_ultimate_expert_specialization_in_mixture-of-experts_language_models
  • review:2024-01_efficient_tool_use_with_chain-of-abstraction_reasoning
  • review:2024-01_in-context_learning_with_retrieved_demonstrations_for_language_models_a_survey
  • review:2024-01_large_language_models_for_robotics_opportunities_challenges_and_perspectives
  • review:2024-01_large_language_model_based_multi-agents_a_survey_of_progress_and_challenges
  • review:2024-01_llm_maybe_longlm_self-extend_llm_context_window_without_tuning
  • review:2024-01_medusa_simple_llm_inference_acceleration_framework_with_multiple_decoding_heads
  • review:2024-01_metacognition_is_all_you_need_using_introspection_in_generative_agents_to_improve_goal-directed_behavior
  • review:2024-01_mixtral_of_experts
  • review:2024-01_mm-llms_recent_advances_in_multimodal_large_language_models
  • review:2024-01_monte_carlo_tree_search_for_recipe_generation_using_gpt-2
  • review:2024-01_rag_vs_fine-tuning_pipelines_tradeoffs_and_a_case_study_on_agriculture
  • review:2024-01_reft_reasoning_with_reinforced_fine-tuning
  • review:2024-01_secrets_of_rlhf_in_large_language_models_part_ii_reward_modeling
  • review:2024-01_self-rewarding_language_models
  • review:2024-01_speechagents_human-communication_simulation_with_multi-modal_multi-agent_systems
  • review:2024-01_tinyllama_an_open-source_small_language_model
  • review:2024-01_towards_conversational_diagnostic_ai
  • review:2024-01_warm_on_the_benefits_of_weight_averaged_reward_models
  • review:2024-02_large_language_model_for_table_processing_a_survey
  • review:2024-02_puzzle_solving_using_reasoning_of_large_language_models_a_survey
  • review:2024-02_s-agents_self-organizing_agents_in_open-ended_environments
  • review:2024-02_the_era_of_1-bit_llms_all_large_language_models_are_in_1.58_bits
  • review:2024-02_tinyllm_learning_a_small_student_from_multiple_large_language_models
  • review:2024-02_weblinx_real-world_website_navigation_with_multi-turn_dialogue
  • review:2024-03_dipaco_distributed_path_composition
  • review:2024-03_evaluate_llms_in_real_time_with_street_fighter_iii
  • review:2024-03_explorllm_guiding_exploration_in_reinforcement_learning_with_large_language_models
  • review:2024-03_galore_memory-efficient_llm_training_by_gradient_low-rank_projection
  • review:2024-03_gemma_open_models_based_on_gemini_research_and_technology
  • review:2024-03_parameter-efficient_fine-tuning_for_large_models_a_comprehensive_survey
  • review:2024-04_a_survey_on_efficient_inference_for_large_language_models
  • review:2024-04_a_survey_on_integration_of_large_language_models_with_intelligent_robots
  • review:2024-04_a_survey_on_self-evolution_of_large_language_models
  • review:2024-04_a_survey_on_the_memory_mechanism_of_large_language_model_based_agents
  • review:2024-04_megalodon_efficient_llm_pretraining_and_inference_with_unlimited_context_length
  • review:2024-04_openelm_an_efficient_language_model_family_with_open-source_training_and_inference_framework
  • review:2024-04_player-driven_emergence_in_llm-driven_game_narrative
  • review:2024-04_pre-training_small_base_lms_with_fewer_tokens
  • review:2024-04_toward_self-improvement_of_llms_via_imagination_searching_and_criticizing
  • review:2024-11_transformers_are_multi-state_rnns
  • review:2025-02_llm_post-training_a_deep_dive_into_reasoning_large_language_models
  • review:2025-05-05_voila_voice-language_foundation_models_for_real-time_autonomous_interaction_and_voice_role-play

문서 도구

  • 문서 보기
  • 이전 판
  • 역링크
  • Fold/unfold all
  • 맨 위로
별도로 명시하지 않을 경우, 이 위키의 내용은 다음 라이선스에 따라 사용할 수 있습니다: CC Attribution-Noncommercial-Share Alike 4.0 International
CC Attribution-Noncommercial-Share Alike 4.0 International Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki