Conversational Response Selection On Rrs
평가 지표
MAP
MRR
P@1
R10@1
R10@2
R10@5
평가 결과
이 벤치마크에서 각 모델의 성능 결과
비교 표
모델 이름 | MAP | MRR | P@1 | R10@1 | R10@2 | R10@5 |
---|---|---|---|---|---|---|
fine-grained-post-training-for-improving | 0.702 | 0.712 | 0.543 | 0.488 | 0.708 | 0.927 |
domain-adaptive-training-bert-for-response | 0.625 | 0.639 | 0.453 | 0.404 | 0.606 | 0.875 |
multi-turn-response-selection-for-chatbots | 0.511 | 0.534 | 0.347 | 0.308 | 0.457 | 0.751 |
sequential-matching-network-a-new | 0.487 | 0.501 | 0.309 | 0.281 | 0.442 | 0.723 |
speaker-aware-bert-for-multi-turn-response | 0.701 | 0.715 | 0.555 | 0.497 | 0.685 | 0.931 |
multi-hop-selector-network-for-multi-turn | 0.550 | 0.563 | 0.383 | 0.343 | 0.498 | 0.798 |
dialogue-response-selection-with-hierarchical | 0.671 | 0.683 | 0.503 | 0.454 | 0.659 | 0.917 |