Question Similarity On Q2Q Arabic Benchmark
평가 지표
F1 score
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | F1 score | Paper Title | Repository |
---|---|---|---|
mBert | 0.8365 | Deep Learning Models for Multilingual Hate Speech Detection | |
Ensemble multilingual BERT model | 0.95924 | The Inception Team at NSURL-2019 Task 8: Semantic Question Similarity in Arabic | - |
Tha3aroon | 0.94848 | Tha3aroon at NSURL-2019 Task 8: Semantic Question Similarity in Arabic |
0 of 3 row(s) selected.