HyperAI

Efficient Vits On Imagenet 1K With Lv Vit S

Metrics

GFLOPs
Top 1 Accuracy

Results

Performance results of various models on this benchmark

Model Name
GFLOPs
Top 1 Accuracy
Paper TitleRepository
MCTF ($r=16$)3.682.3Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers
MCTF ($r=8$)4.983.5Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers
DynamicViT (70%)4.683.0DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
Base (LV-ViT-S)6.683.3All Tokens Matter: Token Labeling for Training Better Vision Transformers
eTPS3.882.5Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers
SPViT4.383.1SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
DynamicViT (80%)5.183.2DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
EViT (50%)3.982.5Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
DPS-LV-ViT-S4.582.9Patch Slimming for Efficient Vision Transformers-
PS-LV-ViT-S4.782.4Patch Slimming for Efficient Vision Transformers-
DiffRate3.982.6DiffRate : Differentiable Compression Rate for Efficient Vision Transformers
MCTF ($r=12$)4.283.4Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers
AS-LV-S (60%)3.982.6Adaptive Sparse ViT: Towards Learnable Adaptive Token Pruning by Fully Exploiting Self-Attention-
EViT (70%)4.783.0Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
AS-LV-S (70%)4.683.1Adaptive Sparse ViT: Towards Learnable Adaptive Token Pruning by Fully Exploiting Self-Attention-
dTPS3.882.6Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers
BAT4.783.1Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers
PPT4.683.1PPT: Token Pruning and Pooling for Efficient Vision Transformers
DynamicViT (90%)5.883.3DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
0 of 19 row(s) selected.