HyperAI초신경

Multimodal Reasoning On Rebus

평가 지표

Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Accuracy
rebus-a-robust-evaluation-benchmark-of-10.6
rebus-a-robust-evaluation-benchmark-of-10.9
rebus-a-robust-evaluation-benchmark-of-10.9
rebus-a-robust-evaluation-benchmark-of-11.8
rebus-a-robust-evaluation-benchmark-of-113.2
rebus-a-robust-evaluation-benchmark-of-11.5
rebus-a-robust-evaluation-benchmark-of-10.9
rebus-a-robust-evaluation-benchmark-of-124.0