HyperAI초신경

Zero Shot Learning On Medconceptsqa

평가 지표

Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Accuracy
gpt-4-technical-report-152.489
clinical-longformer-and-clinical-bigbird25.040
biomedgpt-open-multimodal-generative-pre24.747
gatortron-a-large-clinical-language-model-to24.862
zephyr-direct-distillation-of-lm-alignment25.538
small-language-models-learn-enhanced25.680
모델 724.427
llama-open-and-efficient-foundation-language-125.840
biomistral-a-collection-of-open-source24.569
meditron-70b-scaling-medical-pretraining-for25.360
language-models-are-few-shot-learners37.058
meditron-70b-scaling-medical-pretraining-for25.751
biobert-a-pre-trained-biomedical-language26.151