Command Palette

Search for a command to run...

Natural Language Inference On Rcb

평가 지표

Accuracy
Average F1

평가 결과

이 벤치마크에서 각 모델의 성능 결과

Paper Title
Human Benchmark0.7020.68RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark
Golden Transformer0.5460.406-
ruRoberta-large finetune0.5180.357-
ruBert-base finetune0.5090.333-
ruBert-large finetune0.50.356-
ruT5-large-finetune0.4980.306-
SBERT_Large_mt_ru_finetuning0.4860.351-
RuGPT3Large 0.4840.417-
RuBERT conversational0.4840.452-
majority_class0.4840.217Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks
RuGPT3Small0.4730.356-
ruT5-base-finetune0.4680.307-
RuBERT plain0.4630.367-
RuGPT3Medium0.4610.372-
MT5 Large0.4540.366mT5: A massively multilingual pre-trained text-to-text transformer
SBERT_Large0.4520.371-
YaLM 1.0B few-shot0.4470.408-
Multilingual Bert0.4450.367-
Baseline TF-IDF1.10.4410.301RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark
heuristic majority0.4380.4Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks
0 of 22 row(s) selected.
Natural Language Inference On Rcb | SOTA | HyperAI초신경