HyperAI

Logical Reasoning On Ruworldtree

Metrics

Accuracy

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAccuracy
tape-assessing-few-shot-russian-language38.0
tape-assessing-few-shot-russian-language83.7
tape-assessing-few-shot-russian-language34.0
tape-assessing-few-shot-russian-language40.7