HyperAI

Stabletoolbench

Metriken

average
i1 category
i1 instruction
i1 tool
i2 category
i2 instruction
i3 instruction
llm_model
model_url
organization
parameters
release_date
updated_time

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
average
i1 category
i1 instruction
i1 tool
i2 category
i2 instruction
i3 instruction
llm_model
model_url
organization
parameters
release_date
updated_time
Paper TitleRepository
API46.6±1.347.3±0.652.2±1.153.6±1.342.5±2.135.8±2.048.1±0.8GPT-3.5-Turbo-0613 (CoT)https://community.openai.com/t/gpt-3-5-turbo-0613-function-calling-16k-context-window-and-lower-prices/263263OpenAIN/A2023.6.132024.8.11--
0 of 1 row(s) selected.