HyperAI초신경

홈 뉴스 연구 논문 튜토리얼 데이터셋 백과사전 SOTA LLM 모델 GPU 랭킹 컨퍼런스

한국어

HyperAI초신경

Multimodal Reasoning On Rebus

평가 지표

Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	Accuracy	Paper Title	Repository
InstructBLIP	0.6	REBUS: A Robust Evaluation Benchmark of Understanding Symbols	-
BLIP2-FLAN-T5-XXL	0.9	REBUS: A Robust Evaluation Benchmark of Understanding Symbols	-
CogVLM	0.9	REBUS: A Robust Evaluation Benchmark of Understanding Symbols	-
LLaVa-1.5-13B	1.8	REBUS: A Robust Evaluation Benchmark of Understanding Symbols	-
Gemini Pro	13.2	REBUS: A Robust Evaluation Benchmark of Understanding Symbols	-
LLaVa-1.5-7B	1.5	REBUS: A Robust Evaluation Benchmark of Understanding Symbols	-
QWEN	0.9	REBUS: A Robust Evaluation Benchmark of Understanding Symbols	-
GPT-4V	24.0	REBUS: A Robust Evaluation Benchmark of Understanding Symbols	-

0 of 8 row(s) selected.