HyperAI초신경

Bias Detection On Stereoset 1

평가 지표

ICAT Score
LMS
SS

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름ICAT ScoreLMSSS
galactica-a-large-language-model-for-science-16074.859.9
stereoset-measuring-stereotypical-bias-in71.73--
stereoset-measuring-stereotypical-bias-in69.89--
stereoset-measuring-stereotypical-bias-in70.54--
galactica-a-large-language-model-for-science-165.67556.2
stereoset-measuring-stereotypical-bias-in72.03--
stereoset-measuring-stereotypical-bias-in62.10--
stereoset-measuring-stereotypical-bias-in71.21--
stereoset-measuring-stereotypical-bias-in67.50--
galactica-a-large-language-model-for-science-160.877.660.8
stereoset-measuring-stereotypical-bias-in72.97--