Legal Reasoning On Legalbench Rule Recall
Metrics
Balanced Accuracy
Results
Performance results of various models on this benchmark
Model Name | Balanced Accuracy | Paper Title | Repository |
---|---|---|---|
GPT-4 | 59.2 | GPT-4 Technical Report |
0 of 1 row(s) selected.
Performance results of various models on this benchmark
Model Name | Balanced Accuracy | Paper Title | Repository |
---|---|---|---|
GPT-4 | 59.2 | GPT-4 Technical Report |