HyperAI초신경

Olympiadbench

평가 지표

average
llm_model
maths_avg.
maths_en_comp
maths_zh_cee
maths_zh_comp
model_url
organization
parameters
physics_avg.
physics_en_comp
physics_zh_cee
release_date
updated_time

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름averagellm_modelmaths_avg.maths_en_compmaths_zh_ceemaths_zh_compmodel_urlorganizationparametersphysics_avg.physics_en_compphysics_zh_ceerelease_dateupdated_time
모델 13.65LLaVA-NeXT-34B4.33.984.642.6https://github.com/LLaVA-VL/LLaVA-NeXTGoogle34B2.081.362.322024.1.302024.6.6