Common Sense Reasoning On Swag
评估指标
Test
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Test |
---|---|
roberta-a-robustly-optimized-bert-pretraining | 89.9 |
swag-a-large-scale-adversarial-dataset-for | 52.7 |
swag-a-large-scale-adversarial-dataset-for | 59.2 |
bert-pre-training-of-deep-bidirectional | 86.3 |
deberta-decoding-enhanced-bert-with | 90.8 |