HyperAI

Common Sense Reasoning On Commonsenseqa

المقاييس

Accuracy

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجAccuracy
fusing-context-into-knowledge-graph-for83.3
unifiedqa-crossing-format-boundaries-with-a79.1
towards-generalizable-neuro-symbolic-systems73.2
chain-of-thought-prompting-elicits-reasoning28.6
kagnet-knowledge-aware-graph-networks-for58.9
bloomberggpt-a-large-language-model-for66.4
hierarchical-prompting-taxonomy-a-universal92.54
unifiedqa-crossing-format-boundaries-with-a64
unifying-language-learning-paradigms51.4
star-bootstrapping-reasoning-with-reasoning68.8
star-bootstrapping-reasoning-with-reasoning36.6
palm-2-technical-report-190.4
unifiedqa-crossing-format-boundaries-with-a78.1
roberta-a-robustly-optimized-bert-pretraining72.1
human-parity-on-commonsenseqa-augmenting-self73.0
bloomberggpt-a-large-language-model-for64.2
unifying-language-learning-paradigms34.2
human-parity-on-commonsenseqa-augmenting-self91.2
deep-bidirectional-language-knowledge-graph78.2
align-mask-and-select-a-simple-method-for62.2
muppet-massive-multi-task-representations79.2
bloomberggpt-a-large-language-model-for60.4
bloomberggpt-a-large-language-model-for65.5
star-bootstrapping-reasoning-with-reasoning72.3
graph-based-reasoning-over-heterogeneous75.3
explain-yourself-leveraging-language-models64.7
unifiedqa-crossing-format-boundaries-with-a76.2
unifiedqa-crossing-format-boundaries-with-a62.5
commonsenseqa-a-question-answering-challenge55.9
grapeqa-graph-augmentation-and-pruning-to73.5
unicorn-on-rainbow-a-universal-commonsense79.3
star-bootstrapping-reasoning-with-reasoning60.0
albert-a-lite-bert-for-self-supervised76.5
star-bootstrapping-reasoning-with-reasoning 55.6
human-parity-on-commonsenseqa-augmenting-self89.4
star-bootstrapping-reasoning-with-reasoning20.9
unifying-language-learning-paradigms55.7
qa-gnn-reasoning-with-language-models-and76.1