HyperAI

Question Answering On Narrativeqa

Metriken

BLEU-1
BLEU-4
METEOR
Rouge-L

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameBLEU-1BLEU-4METEORRouge-L
multi-style-generative-reading-comprehension48.720.9821.9554.74
commonsense-for-generative-multi-hop-question43.6321.0719.0344.16
densely-connected-attention-propagation-for44.3527.6121.8044.69
a-discrete-hard-em-approach-for-weakly---58.8
distilling-knowledge-from-reader-to-retriever-135.37.511.132
cut-to-the-chase-a-context-zoom-in-network42.7622.4919.2446.67
multi-style-generative-reading-comprehension54.1130.4326.1359.87
the-narrativeqa-reading-comprehension54.60/55.5526.71/27.78--
multi-granular-sequence-encoding-via-dilated36.5519.7917.8741.44
bidirectional-attention-flow-for-machine33.4515.6915.6836.74