HyperAI

Multiple Choice Question Answering Mcqa On 25

Metriken

Accuracy

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameAccuracy
towards-expert-level-medical-question92.3
towards-expert-level-medical-question95.2
biomedgpt-open-multimodal-generative-pre51.1
towards-expert-level-medical-question93.4
llama-2-open-foundation-and-fine-tuned-chat43.38
llama-2-open-foundation-and-fine-tuned-chat40.07