Command Palette
Search for a command to run...
Spatial Token Mixer
Spatial Token Mixer (STM) is a module specifically designed for vision transformers, aiming to enhance the efficiency of token mixing. By performing depthwise convolution operations on the spatial dimensions of tokens, STM can serve as a plug-and-play alternative to the token mixing layer in vision transformers, effectively improving the model's performance and computational efficiency.