SGDR: Stochastic Gradient Descent with Warm Restarts
https://arxiv.org/abs/1608.03983
SGDR
,
2017
,
SGD
,
optimizer
,
learning rate
,
HPO
,
warm restart