Explanation Generation On E Snli Ve
评估指标
Human Explanation Rating
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Human Explanation Rating |
---|---|
harnessing-the-power-of-multi-task | 85.7 |
harnessing-the-power-of-multi-task | 80.4 |
各个模型在此基准测试上的表现结果
模型名称 | Human Explanation Rating |
---|---|
harnessing-the-power-of-multi-task | 85.7 |
harnessing-the-power-of-multi-task | 80.4 |