2021-01 ZeRO-Offload: Democratizing Billion-Scale Model Training
https://arxiv.org/abs/2101.06840
https://www.deepspeed.ai/
https://github.com/microsoft/DeepSpeed
zero-offload
,
zero
,
deepspeed
,
offloading
,
memory efficient
,
Microsoft
,
Jie Ren
,
Yuxiong He
,
2021