HyperAI

Common Sense Reasoning On Arc Challenge

المقاييس

Accuracy

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجAccuracy
model-card-and-evaluations-for-claude-models91
large-language-models-can-self-improve88.3
gpt-4-technical-report-196.4
parameter-efficient-sparsity-crafting-from65.2
النموذج 591.03
mixture-of-subspaces-in-low-rank-adaptation81.5
large-language-models-can-self-improve85.2
glam-efficient-scaling-of-language-models50.3
large-language-models-can-self-improve87.1
designing-effective-sparse-expert-models86.5
palm-2-technical-report-159.6
galactica-a-large-language-model-for-science-132.9
galactica-a-large-language-model-for-science-167.9
galactica-a-large-language-model-for-science-131.1
mixlora-enhancing-large-language-models-fine58.1
massive-language-models-can-be-accurately25.6
glam-efficient-scaling-of-language-models48.2
mixlora-enhancing-large-language-models-fine69.9
mixlora-enhancing-large-language-models-fine79.9
palm-2-technical-report-195.1
massive-language-models-can-be-accurately43.94
language-models-are-few-shot-learners51.4
large-language-models-can-self-improve89.8
palm-2-technical-report-164.9
unifying-language-learning-paradigms49.5
llama-open-and-efficient-foundation-language-156.0
النموذج 2791.04
unifying-language-learning-paradigms29.8
massive-language-models-can-be-accurately38.99
palm-2-technical-report-169.2
llama-open-and-efficient-foundation-language-147.6
pythia-a-suite-for-analyzing-large-language36.8
massive-language-models-can-be-accurately41.3
pythia-a-suite-for-analyzing-large-language31.8
massive-language-models-can-be-accurately39.85
large-language-models-can-self-improve87.2
finetuned-language-models-are-zero-shot63.1
llama-open-and-efficient-foundation-language-152.7
model-card-and-evaluations-for-claude-models85.7
designing-effective-sparse-expert-models56.9
llama-open-and-efficient-foundation-language-157.8
bloomberggpt-a-large-language-model-for50.85
model-card-and-evaluations-for-claude-models90
large-language-models-can-self-improve88.7
language-models-are-few-shot-learners53.2
bloomberggpt-a-large-language-model-for48.63
unifying-language-learning-paradigms42.9
bloomberggpt-a-large-language-model-for45.39
finetuned-language-models-are-zero-shot63.8
galactica-a-large-language-model-for-science-151.4
mistral-7b55.5
gpt-4-technical-report-185.2
bloomberggpt-a-large-language-model-for44.54
textbooks-are-all-you-need-ii-phi-1-544.9