HyperAI

Image Captioning On Nocaps Out Of Domain

Metriken

CIDEr
SPICE

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameCIDErSPICE
clipcap-clip-prefix-for-image-captioning49.149.57
Modell 221.37.2
clipcap-clip-prefix-for-image-captioning49.359.7
Modell 472.1311.53
Modell 530.098.08
Modell 626.557.72
Modell 758.488.77
Modell 830.098.08
vivo-surpassing-human-performance-in-novel110.1413.74
Modell 1071.4310.57
Modell 1148.738.2
Modell 1288.5413.87
Modell 1370.2110.15
Modell 14103.7513.75
Modell 1585.1811.18
Modell 1668.9210.05
Modell 1787.5112.52
Modell 1877.3911.59
Modell 1923.077.4
Modell 2068.510.01
Modell 2154.569.9
git-a-generative-image-to-text-transformer122.2715.62
vinvl-making-visual-representations-matter-in78.0111.48
simvlm-simple-visual-language-model109.4913.89
Modell 2526.257.52
Modell 2691.6214.21
Modell 2787.1511.43
Modell 28121.6915.13
Modell 2936.129.39
git-a-generative-image-to-text-transformer122.0415.7
Modell 3139.397.62
Modell 3275.3910.68
Modell 3366.679.74
Modell 3443.29.35
Modell 3578.9112.14
Modell 3625.917.61
Modell 3773.759.72
pali-a-jointly-scaled-multilingual-language126.6715.49
Modell 39106.5514.21
grit-faster-and-better-image-captioning72.611.1