Common Sense Reasoning
Common Sense Reasoning任务旨在使模型超越模式识别,运用常识或世界知识进行推理。其目标是让模型能够理解复杂情境,做出合理判断与预测,提高在自然语言处理、对话系统等领域的智能化水平与应用效果。
ARC (Challenge)
ARC (Easy)
GAL 120B (0-shot)
BIG-bench (Causal Judgment)
BIG-bench (Date Understanding)
BIG-bench (Disambiguation QA)
BIG-bench (Known Unknowns)
PaLM-540B (few-shot, k=5)
BIG-bench (Logical Sequence)
Chinchilla-70B (few-shot, k=5)
BIG-bench (Sports Understanding)
BIG-bench (Winowhy)
CODAH
BERT Large
CommonsenseQA
QA-GNN
CrowdSource QA
Event2Mind dev
Event2Mind test
EA-VQ-VAE
PARus
ReCoRD
ST-MoE-32B 269B (fine-tuned)
RuCoS
Russian Event2Mind
araneum word2vec (skipgram) + GRU
RWSD
SWAG
DeBERTalarge
Visual Dialog v0.9
Visual Dialog v0.9
NMN [kottur2018visual]
WinoGAViL
ViLT
WinoGrande
PaLM 540B (0-shot)