Multi Lingual Image Text Classification On
Metrics
Top 1 Accuracy %
Top 5 Accuracy %
Results
Performance results of various models on this benchmark
Model Name | Top 1 Accuracy % | Top 5 Accuracy % | Paper Title | Repository |
---|---|---|---|---|
EmbraceNet (image+text) | 69.7 | 94.0 | GLAMI-1M: A Multilingual Image-Text Fashion Dataset | |
CLIP (zero-shot image+text) | 32.3 | 74.5 | GLAMI-1M: A Multilingual Image-Text Fashion Dataset |
0 of 2 row(s) selected.