HyperAIHyperAI

Command Palette

Search for a command to run...

Spatial Token Mixer

Spatial Token Mixer (STM) is a module specifically designed for vision transformers, aiming to enhance the efficiency of token mixing. By performing depthwise convolution operations on the spatial dimensions of tokens, STM can serve as a plug-and-play alternative to the token mixing layer in vision transformers, effectively improving the model's performance and computational efficiency.

No Data
No benchmark data available for this task