Boundary Detection On Coauthor
评估指标
Cohen’s Kappa score
评测结果
各个模型在此基准测试上的表现结果
模型名称 | Cohen’s Kappa score | Paper Title | Repository |
---|---|---|---|
GigaCheck (Mistral-7B-v0.3) | 0.4158 | GigaCheck: Detecting LLM-generated Content | - |
GigaCheck (DN-DAB-DETR) | 0.1885 | GigaCheck: Detecting LLM-generated Content | - |
DeBERTa-v3 (Naive) | 0.4002 | Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and Insights |
0 of 3 row(s) selected.