Apps
Metriken
llm_model
model_url
organization
parameters
release_date
strict accuracy-average
strict accuracy-competitive
strict accuracy-interview
strict accuracy-introductory
test case average-average
test case average-competitive
test case average-interview
test case average-introductory
updated_time
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Vergleichstabelle
Modellname | llm_model | model_url | organization | parameters | release_date | strict accuracy-average | strict accuracy-competitive | strict accuracy-interview | strict accuracy-introductory | test case average-average | test case average-competitive | test case average-interview | test case average-introductory | updated_time |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Modell 1 | GPT-2 0.1B | https://huggingface.co/transformers/v3.0.2/model_doc/gpt2.html | OpenAI | 0.1B | 2019.2.14 | 0.40 | 0.00 | 0.33 | 1.00 | 6.16 | 4.37 | 6.93 | 5.64 | 2021.11.8 |