HyperAI

Dialogue Safety Prediction On Rt Inod

Metrics

Best-of

Results

Performance results of various models on this benchmark

Comparison Table
Model NameBest-of
benchmarking-llama2-mistral-gemma-and-gpt-for0.91
benchmarking-llama2-mistral-gemma-and-gpt-for0.87
benchmarking-llama2-mistral-gemma-and-gpt-for0.91
benchmarking-llama2-mistral-gemma-and-gpt-for0.86
benchmarking-llama2-mistral-gemma-and-gpt-for0.92