Boundary Detection On Coauthor

Cohen’s Kappa score

평가 결과

이 벤치마크에서 각 모델의 성능 결과

		Paper Title
GigaCheck (Mistral-7B-v0.3)	0.4158	GigaCheck: Detecting LLM-generated Content
DeBERTa-v3 (Naive)	0.4002	Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and Insights
GigaCheck (DN-DAB-DETR)	0.1885	GigaCheck: Detecting LLM-generated Content

0 of 3 row(s) selected.