HyperAI超神经

Answerability Prediction On Peerqa

评估指标

Macro F1

评测结果

各个模型在此基准测试上的表现结果

模型名称
Macro F1
Paper TitleRepository
Command-R-v01-34B-128k0.4197--
GPT-3.5-Turbo-0613-16k0.3304Language Models are Few-Shot Learners
Mistral-IT-v02-7B-32k0.4703Mistral 7B
Llama-3-IT-8B-32k0.2881The Llama 3 Herd of Models
GPT-4o-2024-08-060.3087GPT-4 Technical Report
Llama-3-IT-8B-8k0.3112The Llama 3 Herd of Models
0 of 6 row(s) selected.