Question Answering On Drop

평가 지표

Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Accuracy
Paper TitleRepository
PaLM 540B (Self Consistency)78.2Large Language Models Can Self-Improve-
PaLM 540B (Self Improvement, Self Consistency)83Large Language Models Can Self-Improve-
PaLM 540B (Self Improvement, Standard-Prompting)71.7Large Language Models Can Self-Improve-
PaLM 540B (Standard-Prompting)60Large Language Models Can Self-Improve-
PaLM 540B (CoT Prompting)70.6Large Language Models Can Self-Improve-
PaLM 540B (Self Improvement, CoT Prompting)76.2Large Language Models Can Self-Improve-
0 of 6 row(s) selected.
Question Answering On Drop | SOTA | HyperAI초신경