HyperAI超神经

Gorilla

评估指标

huggingface_err
huggingface_hallu
huggingface_overall
llm_model
model_url
organization
parameters
release_date
tensorflow hub_err
tensorflow hub_hallu
tensorflow hub_overall
torchhub_err
torchhub_hallu
torchhub_overall
updated_time

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称huggingface_errhuggingface_halluhuggingface_overallllm_modelmodel_urlorganizationparametersrelease_datetensorflow hub_errtensorflow hub_hallutensorflow hub_overalltorchhub_errtorchhub_hallutorchhub_overallupdated_time
模型 12.4397.570.00LLAMA (0-shot)https://github.com/meta-llama/llamaMetaN/A2023.2.240.00100.000.000.00100.000.002023.5.24