Bias Detection On Stereoset 1
Metrics
ICAT Score
LMS
SS
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | ICAT Score | LMS | SS |
---|---|---|---|
galactica-a-large-language-model-for-science-1 | 60 | 74.8 | 59.9 |
stereoset-measuring-stereotypical-bias-in | 71.73 | - | - |
stereoset-measuring-stereotypical-bias-in | 69.89 | - | - |
stereoset-measuring-stereotypical-bias-in | 70.54 | - | - |
galactica-a-large-language-model-for-science-1 | 65.6 | 75 | 56.2 |
stereoset-measuring-stereotypical-bias-in | 72.03 | - | - |
stereoset-measuring-stereotypical-bias-in | 62.10 | - | - |
stereoset-measuring-stereotypical-bias-in | 71.21 | - | - |
stereoset-measuring-stereotypical-bias-in | 67.50 | - | - |
galactica-a-large-language-model-for-science-1 | 60.8 | 77.6 | 60.8 |
stereoset-measuring-stereotypical-bias-in | 72.97 | - | - |