HyperAI

Logical Reasoning On Winograd Automatic

Metriken

Accuracy

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameAccuracy
tape-assessing-few-shot-russian-language57.9
tape-assessing-few-shot-russian-language57.2
tape-assessing-few-shot-russian-language87.0
tape-assessing-few-shot-russian-language55.5