HyperAI超神经
首页
资讯
最新论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
首页
SOTA
Dialogue Safety Prediction
Dialogue Safety Prediction On Rt Inod
Dialogue Safety Prediction On Rt Inod
评估指标
Best-of
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
Best-of
Paper Title
Repository
Gemma
0.91
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
Mistral
0.87
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
GPT-4
0.91
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
Llama2
0.86
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
Baseline
0.92
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
0 of 5 row(s) selected.
Previous
Next