Tags ai9 backpropagation8 deep learning12 entropy3 gradients1 index notation8 information theory3 interactive3 jensen-shannon1 jensen-shannon divergence1 kl divergence3 kullback-leibler3 language models1 llm1 logprobs1 maths12 mutual information1 parallelism1 perplexity1 probability3 rank correlation1 statistics3 tensor calculus9 visualisation3 zipf1 zipfs law1