HyperAI초신경

Code Generation On Codecontests

평가 지표

Test Set pass@1
Test Set pass@5
Val Set pass@1

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름Test Set pass@1Test Set pass@5Val Set pass@1
mapcoder-multi-agent-code-generation-for28.535.228.5
planning-driven-programming-a-large-language34.7--
wizardcoder-empowering-code-large-language1.113.181.98
motcoder-elevating-large-language-models-with20.77-16.72
codesim-multi-agent-code-generation-and-129.1--
codechain-towards-modular-code-generation2.353.292.48
motcoder-elevating-large-language-models-with26.34-20.35