HyperAI

Bias Detection On Stereoset 1

Metrics

ICAT Score
LMS
SS

Results

Performance results of various models on this benchmark

Comparison Table
Model NameICAT ScoreLMSSS
galactica-a-large-language-model-for-science-16074.859.9
stereoset-measuring-stereotypical-bias-in71.73--
stereoset-measuring-stereotypical-bias-in69.89--
stereoset-measuring-stereotypical-bias-in70.54--
galactica-a-large-language-model-for-science-165.67556.2
stereoset-measuring-stereotypical-bias-in72.03--
stereoset-measuring-stereotypical-bias-in62.10--
stereoset-measuring-stereotypical-bias-in71.21--
stereoset-measuring-stereotypical-bias-in67.50--
galactica-a-large-language-model-for-science-160.877.660.8
stereoset-measuring-stereotypical-bias-in72.97--