Command Palette
Search for a command to run...
Performance results of various models on this benchmark
Metrics
Inst-level loose-accuracy
Inst-level strict-accuracy
Prompt-level loose-accuracy
Prompt-level strict-accuracy
4 Zeilen insgesamt
Search for a command to run...
Performance results of various models on this benchmark
Search for a command to run...
Performance results of various models on this benchmark