HyperAI超神经

Question Answering On Copa

评估指标

Accuracy

评测结果

各个模型在此基准测试上的表现结果

模型名称
Accuracy
Paper TitleRepository
RoBERTa-Winogrande-ft 355M (fine-tuned)90.6WinoGrande: An Adversarial Winograd Schema Challenge at Scale
HASH Layers 10B (0-shot)64Efficient Language Modeling with Sparse all-MLP-
FLAN 137B (prompt-tuned)94Finetuned Language Models Are Zero-Shot Learners
RoBERTa-ft 355M (fine-tuned)86.4WinoGrande: An Adversarial Winograd Schema Challenge at Scale
FLAN 137B (zero-shot)91Finetuned Language Models Are Zero-Shot Learners
ST-MoE-L 4.1B (fine-tuned)91ST-MoE: Designing Stable and Transferable Sparse Expert Models-
T5-XL 3B (fine-tuned)92Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
GPT-3 175B (few-shot, k=32)92Language Models are Few-Shot Learners
Hybrid H3 125M (0-shot, rank classification)67Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Vega v2 6B (KD-based prompt transfer)99.4Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE-
H3 125M (0-shot, rank classification)51Hungry Hungry Hippos: Towards Language Modeling with State Space Models
T5-XXL 11B (fine-tuned)94.8Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
T0-3B (CoT fine-tuned)90.9The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning-
Causal Strength Computation (on ClueWeb12)69.9--
Neo-6B (few-shot)77.0Ask Me Anything: A simple strategy for prompting language models
KiC-770M85.30Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models-
Turing NLR v5 XXL 5.4B (fine-tuned)98.2Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE-
ST-MoE-32B 269B (fine-tuned)99.2ST-MoE: Designing Stable and Transferable Sparse Expert Models-
BERT-large 340M80.8SocialIQA: Commonsense Reasoning about Social Interactions
PaLM 2-M (1-shot)90.0PaLM 2 Technical Report
0 of 60 row(s) selected.