Zero Shot Transfer Image Classification On 6
评估指标
Accuracy (Private)
评测结果
各个模型在此基准测试上的表现结果
模型名称 | Accuracy (Private) | Paper Title | Repository |
---|---|---|---|
InternVL-C | 80.6 | InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks | - |
LiT-tuning | 81.1 | LiT: Zero-Shot Transfer with Locked-image text Tuning | - |
PaLI | 42.62 | PaLI: A Jointly-Scaled Multilingual Language-Image Model | - |
EVA-CLIP-E/14+ | 79.6 | EVA-CLIP: Improved Training Techniques for CLIP at Scale | - |
LiT-22B | 87.6 | Scaling Vision Transformers to 22 Billion Parameters | - |
EVA-CLIP-18B | 82.2 | EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters | - |
CLIP | 72.3 | Learning Transferable Visual Models From Natural Language Supervision | - |
LiT ViT-e | 84.9 | PaLI: A Jointly-Scaled Multilingual Language-Image Model | - |
CoCa | 82.7 | CoCa: Contrastive Captioners are Image-Text Foundation Models | - |
0 of 9 row(s) selected.