Stabletoolbench

average

i1 category

i1 instruction

i1 tool

i2 category

i2 instruction

i3 instruction

llm_model

model_url

organization

parameters

release_date

updated_time

Ergebnisse

Leistungsergebnisse verschiedener Modelle bei diesem Benchmark

														Paper Title	Code
API	46.6±1.3	47.3±0.6	52.2±1.1	53.6±1.3	42.5±2.1	35.8±2.0	48.1±0.8	GPT-3.5-Turbo-0613 (CoT)	https://community.openai.com/t/gpt-3-5-turbo-0613-function-calling-16k-context-window-and-lower-prices/263263	OpenAI	N/A	2023.6.13	2024.8.11	-

0 of 1 row(s) selected.