HyperAI

Text To Sql On Bird Big Bench For Large Scale

Metriken

Execution Accuracy % (Dev)
Execution Accuracy % (Test)

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameExecution Accuracy % (Dev)Execution Accuracy % (Test)
Modell 168.1270.21
msc-sql-multi-sample-critiquing-small65.6-
Modell 359.7160.71
Modell 458.4760.37
Modell 562.9764.51
text-to-sql-empowered-by-large-language54.7657.41
can-llm-already-serve-as-a-database-interface37.2239.30
Modell 855.4863.39
Modell 972.4373.17
can-llm-already-serve-as-a-database-interface--
Modell 1155.4863.39
can-llms-effectively-leverage-structural42.7049.02
Modell 1364.7365.23
Modell 1467.9966.21
Modell 1565.4568.87
Modell 1663.3665.45
can-llm-already-serve-as-a-database-interface34.3536.47
chase-sql-multi-path-reasoning-and-preference73.1474.06
xiyan-sql-a-multi-generator-ensemble73.3475.63
Modell 2069.372.28
chess-contextual-harnessing-for-efficient-sql6566.69
can-llms-effectively-leverage-structural46.3554.89
Modell 2360.564.84
Modell 2462.5863.22
Modell 2557.1759.25
Modell 2658.562.66
mac-sql-multi-agent-collaboration-for-text-to57.5659.59
Modell 2866.8264.00
Modell 2965.3867.86
Modell 3064.62-
knowledge-to-sql-enhancing-sql-generation48.92-
the-death-of-schema-linking-text-to-sql-in67.2171.83
can-llm-already-serve-as-a-database-interface36.6440.08
Modell 3437.6847.74
din-sql-decomposed-in-context-learning-of-150.7255.90
Modell 3666.9569.03
Modell 3772.1670.26
can-llm-already-serve-as-a-database-interface27.3833.04
Modell 3974.3274.12
Modell 4061.3464.95