Halueval

Metrics

dialogue

general

llm_model

model_url

organization

parameters

qa

release_date

summarization

updated_time

Results

Performance results of various models on this benchmark

											Paper Title	Code
API	72.40	79.44	ChatGPT	https://chatgpt.com/	OpenAI	N/A	62.59	2022.11.30	58.53	2023.10.23	-

0 of 1 row(s) selected.

Halueval

Metrics

dialogue

general

llm_model

model_url

organization

parameters

qa

release_date

summarization

updated_time

Results

Performance results of various models on this benchmark

											Paper Title	Code
API	72.40	79.44	ChatGPT	https://chatgpt.com/	OpenAI	N/A	62.59	2022.11.30	58.53	2023.10.23	-

0 of 1 row(s) selected.

Halueval | SOTA | HyperAI