backpropagation 8
- Einsum, Deriving the Gradient for the Backward Pass
- Matrix Inverse, Deriving the Gradient for the Backward Pass
- Cross-Entropy Loss (Softmax) Gradient Used In Deep Learning
- Gradients of Matrix Multiplication in Deep Learning
- Linear Layer, Deriving the Gradient for the Backward Pass
- Layer Normalization, Deriving the Gradient for the Backward Pass
- The Tensor Calculus You Need for Deep Learning
- Backpropagation and Multivariable Calculus