HyperAI초신경

Cflue

평가 지표

llm_model
model_url
organization
parameters
prediction_acc (%)
prediction_f1 (%)
reasoning_bleu-1
reasoning_bleu-4
reasoning_rouge-1
reasoning_rouge-2
reasoning_rouge-l
release_date
updated_time

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름llm_modelmodel_urlorganizationparametersprediction_acc (%)prediction_f1 (%)reasoning_bleu-1reasoning_bleu-4reasoning_rouge-1reasoning_rouge-2reasoning_rouge-lrelease_dateupdated_time
모델 1GPT-4-turbohttps://help.openai.com/en/articles/8555510-gpt-4-turbo-in-the-openai-apiOpenAIN/A60.61±0.2160.31±0.1930.66±0.2210.61±0.1340.28±0.2017.23±0.1528.62±0.192024.5.262024.8.11