HyperAI

Language Modelling On Wikitext 2

Metriken

Number of params
Test perplexity
Validation perplexity

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Vergleichstabelle
ModellnameNumber of paramsTest perplexityValidation perplexity
improving-neural-language-modeling-via35M38.6540.27
direct-output-connection-for-a-high-rank37M58.0360.29
mogrifier-lstm35M55.157.3
hydra-a-system-for-large-multi-model-deep1542M15.1715.69
dynamic-evaluation-of-neural-sequence-models33M44.346.4
alleviating-sequence-information-loss-with33M64.7367.47
improving-neural-language-models-with-a-99.3-
massive-language-models-can-be-accurately-234.77-
language-models-are-unsupervised-multitask345M22.76-
massive-language-models-can-be-accurately-8.21-
frage-frequency-agnostic-word-representation35M39.1440.85
gradual-learning-of-recurrent-neural-networks38M40.4642.19
learning-associative-inference-using-fast-137M61.6554.48
language-models-are-unsupervised-multitask762M19.93-
deep-residual-output-layers-for-neural34M61.964.9
language-models-are-unsupervised-multitask1542M18.34-
regularizing-and-optimizing-lstm-language33M52.053.8
on-the-state-of-the-art-of-evaluation-in24M65.969.3
tying-word-vectors-and-word-classifiers-a-87.792.3
breaking-the-softmax-bottleneck-a-high-rank35M40.6842.41
regularizing-and-optimizing-lstm-language33M65.868.6
breaking-the-softmax-bottleneck-a-high-rank35M61.4563.88
partially-shuffling-the-training-data-to-135M59.9862.38
direct-output-connection-for-a-high-rank185M53.0954.19
tying-word-vectors-and-word-classifiers-a-87.091.5
fraternal-dropout34M64.166.8
partially-shuffling-the-training-data-to-137M57.8560.16
egru-event-based-gru-for-activity-sparse-68.9-
language-models-are-unsupervised-multitask117M29.41-
deep-residual-output-layers-for-neural34M42.043.9
improved-language-modeling-by-decoding-the35M40.342.0
massive-language-models-can-be-accurately-8.34-
improving-neural-language-models-with-a-68.9-
massive-language-models-can-be-accurately-8.73-
massive-language-models-can-be-accurately-8.45-
advancing-state-of-the-art-in-language-53.7355.4
190409408395M34.137.7
mogrifier-lstm35M38.640.2