HyperAI

Logical Reasoning On Winograd Automatic

Metrics

Accuracy

Results

Performance results of various models on this benchmark

Comparison Table
Model NameAccuracy
tape-assessing-few-shot-russian-language57.9
tape-assessing-few-shot-russian-language57.2
tape-assessing-few-shot-russian-language87.0
tape-assessing-few-shot-russian-language55.5