HyperAI超神経

Language Modelling On Text8

評価指標

Bit per Character (BPC)

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名Bit per Character (BPC)
long-short-transformer-efficient-transformers1.09
2305-149520.98
language-models-are-unsupervised-multitask0.98
augmenting-self-attention-with-persistent1.08
adaptive-attention-span-in-transformers1.11
bp-transformer-modelling-long-range-context1.11
character-level-language-modeling-with-deeper1.18
augmenting-self-attention-with-persistent1.11
architectural-complexity-measures-of1.63
dynamic-evaluation-of-neural-sequence-models1.19
multiplicative-lstm-for-sequence-modelling1.27
hierarchical-multiscale-recurrent-neural1.29
architectural-complexity-measures-of1.49
recurrent-highway-networks-with-grouped1.157
multiplicative-lstm-for-sequence-modelling1.40
dynamic-evaluation-of-transformer-language1.038
discrete-flows-invertible-generative-models1.23
bayesian-flow-networks1.41
adaptive-attention-span-in-transformers1.07
character-level-language-modeling-with-deeper1.13
recurrent-highway-networks1.27
transformer-xl-attentive-language-models1.08
recurrent-batch-normalization1.36
pay-attention-when-required1.18