HyperAI

Common Sense Reasoning On Parus

المقاييس

Accuracy

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

اسم النموذج
Accuracy
Paper TitleRepository
RuBERT plain0.574--
majority_class0.498Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks-
Baseline TF-IDF1.10.486RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark
Golden Transformer0.908--
ruRoberta-large finetune0.508--
YaLM 1.0B few-shot0.766--
Multilingual Bert0.528--
heuristic majority0.478Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks-
RuGPT3Medium0.598--
RuBERT conversational0.508--
RuGPT3Large0.584--
MT5 Large0.504mT5: A massively multilingual pre-trained text-to-text transformer
Random weighted0.48Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks-
RuGPT3Small0.562--
Human Benchmark0.982RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark
ruBert-large finetune0.492--
ruT5-large-finetune0.66--
SBERT_Large0.498--
SBERT_Large_mt_ru_finetuning0.498--
ruBert-base finetune0.476--
0 of 22 row(s) selected.