Interactive KL Divergence Visualisation
An interactive visualisation of Kullback–Leibler divergence. Shape two distributions and watch forward vs reverse KL, the pointwise integrand, and the effects of asymmetry, support mismatch and discretisation.
An interactive visualisation of Kullback–Leibler divergence. Shape two distributions and watch forward vs reverse KL, the pointwise integrand, and the effects of asymmetry, support mismatch and discretisation.
Deriving the gradients for the backward pass for matrix multiplication using tensor calculus
How does tensor parallelism work?
Deriving the gradient for the backward pass using tensor calculus and index notation
Obtaining the gradient of the matrix inverse
Obtaining the gradient of the matrix inverse
Obtaining the gradient of the Cross-entropy loss (softmax and negative log-likelihood loss function
Deriving the gradient for the backward pass for the linear layer using tensor calculus
Obtaining the gradient of the layer normalization layer
A quick intro on backpropagation and multivariable calculus for deep learning