HyperAI

Zero Shot Learning On Medconceptsqa

Métriques

Accuracy

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèleAccuracy
gpt-4-technical-report-152.489
clinical-longformer-and-clinical-bigbird25.040
biomedgpt-open-multimodal-generative-pre24.747
gatortron-a-large-clinical-language-model-to24.862
zephyr-direct-distillation-of-lm-alignment25.538
small-language-models-learn-enhanced25.680
Modèle 724.427
llama-open-and-efficient-foundation-language-125.840
biomistral-a-collection-of-open-source24.569
meditron-70b-scaling-medical-pretraining-for25.360
language-models-are-few-shot-learners37.058
meditron-70b-scaling-medical-pretraining-for25.751
biobert-a-pre-trained-biomedical-language26.151