Robot Task Planning On Sheetcopilot
Metrics
Pass@1
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Pass@1 |
---|---|
sheetagent-a-generalist-agent-for-spreadsheet | 61.1% |
sheetcopilot-bringing-software-productivity | 44.3% |