Common Sense Reasoning On Swag
Metrics
Test
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Test |
---|---|
roberta-a-robustly-optimized-bert-pretraining | 89.9 |
swag-a-large-scale-adversarial-dataset-for | 52.7 |
swag-a-large-scale-adversarial-dataset-for | 59.2 |
bert-pre-training-of-deep-bidirectional | 86.3 |
deberta-decoding-enhanced-bert-with | 90.8 |