Multimodal Reasoning On Algopuzzlevqa

Acc

평가 결과

이 벤치마크에서 각 모델의 성능 결과

		Paper Title
GPT-4	30.3	Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning

0 of 1 row(s) selected.