Command Palette
Search for a command to run...
Performance results of various models on this benchmark
Metrics
Execution Accuracy % (Dev)
Execution Accuracy % (Test)
40 行 总计
Search for a command to run...
Performance results of various models on this benchmark