Text To Sql On Spider 2 0
Métriques
Success Rate
Résultats
Résultats de performance de divers modèles sur ce benchmark
Nom du modèle | Success Rate | Paper Title | Repository |
---|---|---|---|
Spider-Agent + Claude-3.5-Sonnect | 9.02 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
Spider-Agent + GPT-4o | 10.13 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
Spider-Agent + DeepSeek-V2.5 | 5.22 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
Spider-Agent + Qwen2.5-72B | 6.17 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
Spider-Agent + GPT-4 | 8.86 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
Spider-Agent + Gemini-Pro-1.5 | 2.53 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
Spider-Agent + Llama-3.1-405B | 2.21 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
Spider-Agent + o1-preview | 17.03 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
0 of 8 row(s) selected.