Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift March 02, 2015 https://arxiv.org/pdf/1502.03167 Fullscreen Dark Mode