HyperAI

Multimodal Reasoning On Algopuzzlevqa

Metrics

Acc

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAcc
are-language-models-puzzle-prodigies30.3