HyperAI초신경

Temporal Casual Qa On Next Qa

평가 지표

WUPS

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름WUPS
flamingo-a-visual-language-model-for-few-shot-126.7
pali-x-on-scaling-up-a-multilingual-vision38.3
pali-3-vision-language-models-smaller-faster37.7
retrieving-to-answer-zero-shot-video-question34.7
flamingo-a-visual-language-model-for-few-shot-133.5
generative-pretraining-in-multimodality23.4
gemini-a-family-of-highly-capable-multimodal-129.9
gemini-a-family-of-highly-capable-multimodal-128.0