HyperAI

Fs Mevqa On Sme

Metriken

#Learning Samples (N)
ACC
BLEU-4
CIDEr
Detection
METEOR
ROUGE-L
SPICE

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
#Learning Samples (N)
ACC
BLEU-4
CIDEr
Detection
METEOR
ROUGE-L
SPICE
Paper TitleRepository
REX1617.770.000.890.004.3723.230.00REX: Reasoning-aware and Grounded Explanation
VCIN1617.779.174.280.2819.8233.3413.39Variational Causal Inference Network for Explanatory Visual Question Answering
Qwen-VL-Max1640.3324.30201.471.0523.4034.5226.13Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
MEAgent1651.4567.91510.4429.0950.5579.4164.09Few-Shot Multimodal Explanation for Visual Question Answering
Gemini-1.5 Pro1640.8841.87276.141.4034.6155.9040.58Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
GLM-4V1634.2314.45127.370.8917.5324.2817.70CogVLM: Visual Expert for Pretrained Language Models
GPT-4-1106-Vision-Preview1642.3045.51269.687.0035.1752.6737.67GPT-4 Technical Report
0 of 7 row(s) selected.