HyperAI

Common Sense Reasoning On Swag

Metrics

Test

Results

Performance results of various models on this benchmark