HyperAI초신경

Multiple Choice Question Answering Mcqa On 25

평가 지표

Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Accuracy
towards-expert-level-medical-question92.3
towards-expert-level-medical-question95.2
biomedgpt-open-multimodal-generative-pre51.1
towards-expert-level-medical-question93.4
llama-2-open-foundation-and-fine-tuned-chat43.38
llama-2-open-foundation-and-fine-tuned-chat40.07