Demonstrate how to do backpropagation using an example of BatchNorm-Sigmoid-MSELoss network with a detailed derivation of gradients and custom implementations.
Please see the details at https://coolgpu.github.io/coolgpu_blog/github/pages/2020/09/14/backpropagation.html.