HyperAI超神经

Common Sense Reasoning On Swag

评估指标

Test

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Test
roberta-a-robustly-optimized-bert-pretraining89.9
swag-a-large-scale-adversarial-dataset-for52.7
swag-a-large-scale-adversarial-dataset-for59.2
bert-pre-training-of-deep-bidirectional86.3
deberta-decoding-enhanced-bert-with90.8