Rylan Schaeffer
Resume
Research
Learning
Blog
Teaching
Jokes
Kernel Papers
Deep Learning
Gradients in Math and Code
Normalization in Deep Learning
Layer Normalization
Root Mean Square Layer Normalization
Positional Encodings in Deep Learning
Optimization in Deep Learning