HyperAI
Home
News
Latest Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
English
HyperAI
Toggle sidebar
Search the site…
⌘
K
Home
SOTA
Dialogue Safety Prediction
Dialogue Safety Prediction On Rt Inod
Dialogue Safety Prediction On Rt Inod
Metrics
Best-of
Results
Performance results of various models on this benchmark
Columns
Model Name
Best-of
Paper Title
Repository
Gemma
0.91
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
Mistral
0.87
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
GPT-4
0.91
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
Llama2
0.86
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
Baseline
0.92
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
0 of 5 row(s) selected.
Previous
Next