HyperAI

Fact Selection On Argscichat

Metrics

Fact-F1

Results

Performance results of various models on this benchmark

Comparison Table
Model NameFact-F1
argscichat-a-dataset-for-argumentative16.22
argscichat-a-dataset-for-argumentative13.65
argscichat-a-dataset-for-argumentative 10.58
argscichat-a-dataset-for-argumentative 8.50