Probing Language Models On Kamel
Metrics
Average F1
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Average F1 |
---|---|
kamel-knowledge-analysis-with-multitoken | 17.62 |
Performance results of various models on this benchmark
Model Name | Average F1 |
---|---|
kamel-knowledge-analysis-with-multitoken | 17.62 |