HyperAI초신경

Hellobench

평가 지표

average
chat-rescaled score
heuristic text generation-rescaled score
llm_model
model_url
open-ended qa-rescaled score
organization
parameters
release_date
summarization-rescaled score
text completion-rescaled score
updated_time

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름averagechat-rescaled scoreheuristic text generation-rescaled scorellm_modelmodel_urlopen-ended qa-rescaled scoreorganizationparametersrelease_datesummarization-rescaled scoretext completion-rescaled scoreupdated_time
모델 148.5542.8847.87GPT-4o-2024-08-06https://platform.openai.com/docs/guides54.82OpenAIN/A2024/8/629.7167.492024/9/24