2024-01 Asynchronous Local-SGD Training for Language Modeling
https://arxiv.org/abs/2401.09135
DiLoCo
,
LLM학습
,
LLM
,
분산학습
,
연합학습
,
Google
,
DeepMind
,
2024