HyperAI超神经

Debugbench

评估指标

llm_model
logic_condition error
logic_operation error
logic_other error
logic_variable error
model_url
multiple_double bugs
multiple_quadraple bugs
multiple_triple bugs
organization
parameters
reference_faulty indexing
referenceillegal keywords
referenceundefined methods
referenceundefined objects
release_date
syntax_illegal comment
syntax_illegal indentation
syntax_illegal separation
syntax_missing colons
syntax_misused ==/=
syntax_unclosed parentheses
syntax_unclosed string
updated_time

评测结果

各个模型在此基准测试上的表现结果

模型名称
llm_model
logic_condition error
logic_operation error
logic_other error
logic_variable error
model_url
multiple_double bugs
multiple_quadraple bugs
multiple_triple bugs
organization
parameters
reference_faulty indexing
referenceillegal keywords
referenceundefined methods
referenceundefined objects
release_date
syntax_illegal comment
syntax_illegal indentation
syntax_illegal separation
syntax_missing colons
syntax_misused ==/=
syntax_unclosed parentheses
syntax_unclosed string
updated_time
Paper TitleRepository
APICodeLlama-7b-Instruct13.58.3810https://huggingface.co/codellama/CodeLlama-7b-Instruct-hf3.356.7Meta7B27.258.11521.92023.8.2531.54.47.423.318.227.128.82024.8.11--
0 of 1 row(s) selected.