Search for a command to run...
Transformers without Tears: Improving the Normalization of Self-Attention