Logical Reasoning On Lingoly
المقاييس
Delta_NoContext
Exact Match Accuracy
النتائج
نتائج أداء النماذج المختلفة على هذا المعيار القياسي
جدول المقارنة
اسم النموذج | Delta_NoContext | Exact Match Accuracy |
---|---|---|
lingoly-a-benchmark-of-olympiad-level | 23.4% | 32.1% |
lingoly-a-benchmark-of-olympiad-level | 21.5% | 33.4% |
lingoly-a-benchmark-of-olympiad-level | 11.2% | 21.2% |
lingoly-a-benchmark-of-olympiad-level | 28.8% | 46.3% |
lingoly-a-benchmark-of-olympiad-level | 11.6% | 21.5% |
lingoly-a-benchmark-of-olympiad-level | 4.9% | 11.4% |
lingoly-a-benchmark-of-olympiad-level | 2.9% | 10.3% |
lingoly-a-benchmark-of-olympiad-level | 1.1% | 6.4% |
lingoly-a-benchmark-of-olympiad-level | 25.1% | 37.6% |
lingoly-a-benchmark-of-olympiad-level | 6.4% | 14.2% |
lingoly-a-benchmark-of-olympiad-level | 2.2% | 4.9% |