Bias Detection On Stereoset 1
Métriques
ICAT Score
LMS
SS
Résultats
Résultats de performance de divers modèles sur ce benchmark
Tableau comparatif
Nom du modèle | ICAT Score | LMS | SS |
---|---|---|---|
galactica-a-large-language-model-for-science-1 | 60 | 74.8 | 59.9 |
stereoset-measuring-stereotypical-bias-in | 71.73 | - | - |
stereoset-measuring-stereotypical-bias-in | 69.89 | - | - |
stereoset-measuring-stereotypical-bias-in | 70.54 | - | - |
galactica-a-large-language-model-for-science-1 | 65.6 | 75 | 56.2 |
stereoset-measuring-stereotypical-bias-in | 72.03 | - | - |
stereoset-measuring-stereotypical-bias-in | 62.10 | - | - |
stereoset-measuring-stereotypical-bias-in | 71.21 | - | - |
stereoset-measuring-stereotypical-bias-in | 67.50 | - | - |
galactica-a-large-language-model-for-science-1 | 60.8 | 77.6 | 60.8 |
stereoset-measuring-stereotypical-bias-in | 72.97 | - | - |