Explanation Generation On Vcr
Metrics
Human Explanation Rating
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Human Explanation Rating |
---|---|
harnessing-the-power-of-multi-task | 68.9 |
harnessing-the-power-of-multi-task | 77.3 |