HyperAI超神経

ホームニュース論文チュートリアルデータセット百科事典 SOTA LLMモデル GPU ランキング学会

サイトについて

日本語

HyperAI超神経

Code Generation On Turbulence

評価指標

CorrSc

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名	CorrSc	Paper Title	Repository
GPT-3.5-Turbo	0.617	Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
CodeLlama:13B-4bit-quantised	0.327	Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
GPT-4	0.848	Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
Command	0.063	Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code
CodeLlama:7B-4bit-quantised	0.289	Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code

0 of 5 row(s) selected.