HyperAI

Alignbench

المقاييس

language_avg.
language_chi.
language_fund.
language_open.
language_pro.
language_role.
language_writ.
llm_model
model_url
organization
overall
parameters
reasoning_avg.
reasoning_logi.
reasoning_math.
release_date
updated_time

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجlanguage_avg.language_chi.language_fund.language_open.language_pro.language_role.language_writ.llm_modelmodel_urlorganizationoverallparametersreasoning_avg.reasoning_logi.reasoning_math.release_dateupdated_time
النموذج 18.297.337.998.618.658.478.67gpt-4-1106-previewhttps://community.openai.com/t/gpt-4-1106-preview-vs-gpt-4/588424OpenAI8.01N/A7.737.667.82023.11.62024.8.25