HyperAI초신경

Alignbench

평가 지표

language_avg.
language_chi.
language_fund.
language_open.
language_pro.
language_role.
language_writ.
llm_model
model_url
organization
overall
parameters
reasoning_avg.
reasoning_logi.
reasoning_math.
release_date
updated_time

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름language_avg.language_chi.language_fund.language_open.language_pro.language_role.language_writ.llm_modelmodel_urlorganizationoverallparametersreasoning_avg.reasoning_logi.reasoning_math.release_dateupdated_time
모델 18.297.337.998.618.658.478.67gpt-4-1106-previewhttps://community.openai.com/t/gpt-4-1106-preview-vs-gpt-4/588424OpenAI8.01N/A7.737.667.82023.11.62024.8.25