Question Answering On Casehold
评估指标
Macro F1 (10-fold)
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Macro F1 (10-fold) |
---|---|
when-does-pretraining-help-assessing-self | 61.3 |
when-does-pretraining-help-assessing-self | 68.0 |
when-does-pretraining-help-assessing-self | 69.5 |
各个模型在此基准测试上的表现结果
模型名称 | Macro F1 (10-fold) |
---|---|
when-does-pretraining-help-assessing-self | 61.3 |
when-does-pretraining-help-assessing-self | 68.0 |
when-does-pretraining-help-assessing-self | 69.5 |