Batch normalization plays a crucial role when training deep neural networks.
Supplementary notes can be added here, including code and math.