Boundary Detection On Coauthor
평가 지표
Cohen’s Kappa score
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | Cohen’s Kappa score | Paper Title | Repository |
---|---|---|---|
GigaCheck (Mistral-7B-v0.3) | 0.4158 | GigaCheck: Detecting LLM-generated Content | - |
GigaCheck (DN-DAB-DETR) | 0.1885 | GigaCheck: Detecting LLM-generated Content | - |
DeBERTa-v3 (Naive) | 0.4002 | Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and Insights |
0 of 3 row(s) selected.