Realtoxicityprompts
평가 지표
exp. max. toxicity_non-toxic
exp. max. toxicity_toxic
llm_model
model_url
organization
parameters
release_date
toxicity prob._non-toxic
toxicity prob._toxic
updated_time
평가 결과
이 벤치마크에서 각 모델의 성능 결과
비교 표
모델 이름 | exp. max. toxicity_non-toxic | exp. max. toxicity_toxic | llm_model | model_url | organization | parameters | release_date | toxicity prob._non-toxic | toxicity prob._toxic | updated_time |
---|---|---|---|---|---|---|---|---|---|---|
모델 1 | 0.58 | 0.78 | GPT-1 | https://medium.com/@2468086464/gpt-1-gpt-2-and-gpt3-what-are-the-differences-bc7d8284a0b8 | OpenAI | N/A | 2018.6.30 | 0.60 | 0.90 | 2020.11.16 |