How AI Training Scales / An Empirical Model of Large-Batch Training