HyperAI

Bias Detection On Stereoset 1

Metriken

ICAT Score
LMS
SS

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameICAT ScoreLMSSS
galactica-a-large-language-model-for-science-16074.859.9
stereoset-measuring-stereotypical-bias-in71.73--
stereoset-measuring-stereotypical-bias-in69.89--
stereoset-measuring-stereotypical-bias-in70.54--
galactica-a-large-language-model-for-science-165.67556.2
stereoset-measuring-stereotypical-bias-in72.03--
stereoset-measuring-stereotypical-bias-in62.10--
stereoset-measuring-stereotypical-bias-in71.21--
stereoset-measuring-stereotypical-bias-in67.50--
galactica-a-large-language-model-for-science-160.877.660.8
stereoset-measuring-stereotypical-bias-in72.97--