Question Similarity On Q2Q Arabic Benchmark
评估指标
F1 score
评测结果
各个模型在此基准测试上的表现结果
模型名称 | F1 score | Paper Title | Repository |
---|---|---|---|
mBert | 0.8365 | Deep Learning Models for Multilingual Hate Speech Detection | |
Ensemble multilingual BERT model | 0.95924 | The Inception Team at NSURL-2019 Task 8: Semantic Question Similarity in Arabic | - |
Tha3aroon | 0.94848 | Tha3aroon at NSURL-2019 Task 8: Semantic Question Similarity in Arabic |
0 of 3 row(s) selected.