HyperAI초신경

홈 뉴스 최신 연구 논문 튜토리얼 데이터셋 백과사전 SOTA LLM 모델 GPU 랭킹 컨퍼런스

한국어

HyperAI초신경

Stabletoolbench

평가 지표

average

i1 category

i1 instruction

i1 tool

i2 category

i2 instruction

i3 instruction

llm_model

model_url

organization

parameters

release_date

updated_time

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	average	i1 category	i1 instruction	i1 tool	i2 category	i2 instruction	i3 instruction	llm_model	model_url	organization	parameters	release_date	updated_time	Paper Title	Repository
API	46.6±1.3	47.3±0.6	52.2±1.1	53.6±1.3	42.5±2.1	35.8±2.0	48.1±0.8	GPT-3.5-Turbo-0613 (CoT)	https://community.openai.com/t/gpt-3-5-turbo-0613-function-calling-16k-context-window-and-lower-prices/263263	OpenAI	N/A	2023.6.13	2024.8.11	-	-

0 of 1 row(s) selected.