Mmsql Performance On Mmsql
评估指标
TDEX
评测结果
各个模型在此基准测试上的表现结果
模型名称 | TDEX | Paper Title | Repository |
---|---|---|---|
SQLCoder-8B | 30.7 | Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types | - |
Gemini-1.5 Flash | 65.8 | Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types | - |
Llama3-8B | 64.0 | Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types | - |
GPT-4 Turbo | 67.0 | Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types | - |
Llama3-70B | 62.8 | Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types | - |
GPT-3.5 Turbo | 64.1 | Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types | - |
0 of 6 row(s) selected.