HyperAI

Explanation Generation On Vcr

Metrics

Human Explanation Rating

Results

Performance results of various models on this benchmark

Comparison Table
Model NameHuman Explanation Rating
harnessing-the-power-of-multi-task68.9
harnessing-the-power-of-multi-task77.3