Code Generation On Bigcodebench Complete

Pass@1

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	Pass@1	Paper Title	Repository
DeepSeek-Coder-V2-Instruct	59.7	BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
GPT-4o-2024-05-13	61.1	BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

0 of 2 row(s) selected.