Salad Bench
评估指标
attack-enhanced-elo
attack-enhanced-safe%
base set-elo
base set-safe%
llm_model
model_url
organization
parameters
release_date
updated_time
评测结果
各个模型在此基准测试上的表现结果
模型名称 | attack-enhanced-elo | attack-enhanced-safe% | base set-elo | base set-safe% | llm_model | model_url | organization | parameters | release_date | updated_time | Paper Title | Repository |
---|---|---|---|---|---|---|---|---|---|---|---|---|
API | 954 | 12.48 | 1016 | 90.45 | ChatGLM3-6B | https://github.com/THUDM/ChatGLM3 | THUDN | 6B | 2023.10.27 | 2024.8.11 | - | - |
0 of 1 row(s) selected.