Zero Shot Transfer Image Classification On 6
평가 지표
Accuracy (Private)
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | Accuracy (Private) | Paper Title | Repository |
---|---|---|---|
InternVL-C | 80.6 | InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks | |
LiT-tuning | 81.1 | LiT: Zero-Shot Transfer with Locked-image text Tuning | |
PaLI | 42.62 | PaLI: A Jointly-Scaled Multilingual Language-Image Model | |
EVA-CLIP-E/14+ | 79.6 | EVA-CLIP: Improved Training Techniques for CLIP at Scale | |
LiT-22B | 87.6 | Scaling Vision Transformers to 22 Billion Parameters | |
EVA-CLIP-18B | 82.2 | EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters | - |
CLIP | 72.3 | Learning Transferable Visual Models From Natural Language Supervision | |
LiT ViT-e | 84.9 | PaLI: A Jointly-Scaled Multilingual Language-Image Model | |
CoCa | 82.7 | CoCa: Contrastive Captioners are Image-Text Foundation Models |
0 of 9 row(s) selected.