Command Palette
Search for a command to run...
Robot Task Planning On Sheetcopilot
評価指標
Pass@1
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
| Paper Title | ||
|---|---|---|
| SheetAgent (GPT-3.5) | 61.1% | SheetAgent: Towards A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models |
| SheetCopilot (NIPS2023) | 44.3% | SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models |
0 of 2 row(s) selected.