HyperAI초신경

홈 뉴스 최신 연구 논문 튜토리얼 데이터셋 백과사전 SOTA LLM 모델 GPU 랭킹 컨퍼런스

한국어

HyperAI초신경

Calm

평가 지표

0-shot cot

0-shot icl

1-shot icl

3-shot icl

average

basic

cn

doubt

ef

en

ignore

llm_model

manual cot

model_url

organization

parameters

release_date

robustness

std

updated_time

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	0-shot cot	0-shot icl	1-shot icl	3-shot icl	average	basic	cn	doubt	ef	en	ignore	llm_model	manual cot	model_url	organization	parameters	release_date	robustness	std	updated_time	Paper Title	Repository
API	54.5	51.9	54.1	60.0	56.8	54.4	52.6	54.6	52.1	58.4	53.9	GPT-4	75.4	https://openai.com/product/gpt-4	OpenAI	N/A	2023/3/14	83.7	9.9	2024/5/1	-	-

0 of 1 row(s) selected.