HyperAI초신경
홈
뉴스
최신 연구 논문
튜토리얼
데이터셋
백과사전
SOTA
LLM 모델
GPU 랭킹
컨퍼런스
전체 검색
소개
한국어
HyperAI초신경
Toggle sidebar
전체 사이트 검색...
⌘
K
홈
SOTA
Dialogue Safety Prediction
Dialogue Safety Prediction On Rt Inod
Dialogue Safety Prediction On Rt Inod
평가 지표
Best-of
평가 결과
이 벤치마크에서 각 모델의 성능 결과
Columns
모델 이름
Best-of
Paper Title
Repository
Gemma
0.91
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
Mistral
0.87
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
GPT-4
0.91
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
Llama2
0.86
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
Baseline
0.92
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
-
0 of 5 row(s) selected.
Previous
Next