HyperAI

Studenteval

Metrics

first failure
first success
humaneval
last failure
last success
llm_model
model_url
organization
parameters
release_date
updated_time

Results

Performance results of various models on this benchmark

Comparison Table
Model Namefirst failurefirst successhumanevallast failurelast successllm_modelmodel_urlorganizationparametersrelease_dateupdated_time
Model 111.7644.8448.1013.9047.40GPT-3.5-Turbo-0301https://platform.openai.com/docs/modelsOpenAIN/A2023.3.12024.8.11