HyperAI초신경

Hurtful Sentence Completion On Honest En

평가 지표

HONEST

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름HONEST
honest-measuring-hurtful-sentence-completion3.33
honest-measuring-hurtful-sentence-completion2.62
honest-measuring-hurtful-sentence-completion2.38
honest-measuring-hurtful-sentence-completion1.19
honest-measuring-hurtful-sentence-completion1.90