HyperAI超神経

Code Generation On Codecontests

評価指標

Test Set pass@1
Test Set pass@5
Val Set pass@1

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Test Set pass@1Test Set pass@5Val Set pass@1
mapcoder-multi-agent-code-generation-for28.535.228.5
planning-driven-programming-a-large-language34.7--
wizardcoder-empowering-code-large-language1.113.181.98
motcoder-elevating-large-language-models-with20.77-16.72
codesim-multi-agent-code-generation-and-129.1--
codechain-towards-modular-code-generation2.353.292.48
motcoder-elevating-large-language-models-with26.34-20.35