Code Generation On Dseval Leetcode

평가 지표

Pass Rate
w/o Intact
w/o PE

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Pass Rate
w/o Intact
w/o PE
Paper TitleRepository
CoML42.542.562.5MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks-
Code Interpreter API45.045.055.0--
ChatDev32.532.550.0--
Chapyter45.045.060.0--
Jupyter-AI57.557.570.0--
0 of 5 row(s) selected.
Code Generation On Dseval Leetcode | SOTA | HyperAI초신경