HyperAI

Logical Reasoning On Ruworldtree

Metriken

Accuracy

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameAccuracy
tape-assessing-few-shot-russian-language38.0
tape-assessing-few-shot-russian-language83.7
tape-assessing-few-shot-russian-language34.0
tape-assessing-few-shot-russian-language40.7