Multiple Choice Qa
自然言語処理(NLP)は、人工知能の一分野で、コンピューターが人間の言葉を理解し、解釈し、生成することを目指しています。その目的は、人間と機械のコミュニケーションのギャップを埋め、情報のやり取りの効率と質を向上させることです。NLPの応用価値は広範で、スマートカスタマーサービス、感情分析、機械翻訳、要約作成などがあり、これらは社会の情報化や企業の智能化に大きく貢献しています。
BIG-bench (Hyperbaton)
BIG-bench (Movie Recommendation)
BIG-bench (Navigate)
BIG-bench (Novel Concepts)
BIG-bench (Ruin Names)
FrenchMedMCQA
CamemBERT
IndicGLUE WSTP Pa
MedMCQA
Meditron-70B (CoT + SC)
MMLU (Abstract Algebra)
GAL 30B (zero-shot)
MMLU (Anatomy)
Med-PaLM 2 (ER)
MMLU (Astronomy)
Chinchilla (few-shot, k=5)
MMLU (Clinical Knowledge)
MMLU (College Biology)
Chinchilla (few-shot, k=5)
MMLU (College Chemistry)
Chinchilla (few-shot, k=5)
MMLU (College Computer Science)
Chinchilla (few-shot, k=5)
MMLU (College Mathematics)
GAL 120B (zero-shot)
MMLU (College Medicine)
MMLU (College Physics)
MMLU (Econometrics)
Gopher (few-shot, k=5)
MMLU (Electrical Engineer)
GAL 120B (zero-shot)
MMLU (Elementary Mathematics)
Chinchilla (few-shot, k=5)
MMLU (Formal Logic)
Gopher (few-shot, k=5)
MMLU (High School Biology)
Chinchilla (few-shot, k=5)
MMLU (High School Chemistry)
Chinchilla (few-shot, k=5)
MMLU (High School Computer Science)
GAL 120B (zero-shot)
MMLU (High School Mathematics)
GAL 120B (zero-shot)
MMLU (High School Physics)
MMLU (High School Statistics)
MMLU (Machine Learning)
Chinchilla (few-shot, k=5)
MMLU (Medical Genetics)
MMLU (Professional medicine)