HyperAI

Code Generation On Codecontests

Metrics

Test Set pass@1
Test Set pass@5
Val Set pass@1

Results

Performance results of various models on this benchmark

Comparison Table
Model NameTest Set pass@1Test Set pass@5Val Set pass@1
mapcoder-multi-agent-code-generation-for28.535.228.5
planning-driven-programming-a-large-language34.7--
wizardcoder-empowering-code-large-language1.113.181.98
motcoder-elevating-large-language-models-with20.77-16.72
codesim-multi-agent-code-generation-and-129.1--
codechain-towards-modular-code-generation2.353.292.48
motcoder-elevating-large-language-models-with26.34-20.35