Command Palette
Search for a command to run...
Code Generation On Webapp1K Duo React
Métriques
pass@1
Résultats
Résultats de performance de divers modèles sur ce benchmark
| Paper Title | ||
|---|---|---|
| claude-3-5-sonnet | 0.679 | A Case Study of Web App Coding with OpenAI Reasoning Models |
| o1-mini | 0.667 | A Case Study of Web App Coding with OpenAI Reasoning Models |
| o1-preview | 0.652 | A Case Study of Web App Coding with OpenAI Reasoning Models |
| gpt-4o-2024-08-06 | 0.531 | A Case Study of Web App Coding with OpenAI Reasoning Models |
| deepseek-v2.5 | 0.49 | A Case Study of Web App Coding with OpenAI Reasoning Models |
| mistral-large-2 | 0.449 | A Case Study of Web App Coding with OpenAI Reasoning Models |
0 of 6 row(s) selected.