HyperAI超神経

Language Modelling On C4

評価指標

Perplexity
Steps
TPUv3 Hours

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

比較表
モデル名PerplexityStepsTPUv3 Hours
primer-searching-for-efficient-transformers12.691M16.5K
llm-int8-8-bit-matrix-multiplication-for12.45--
primer-searching-for-efficient-transformers13.251M15.7K
n-grammer-augmenting-transformers-with-latent-114.79--
llm-int8-8-bit-matrix-multiplication-for14.43--
llm-int8-8-bit-matrix-multiplication-for15.91--
primer-searching-for-efficient-transformers12.351M17.3K
n-grammer-augmenting-transformers-with-latent-115.01--
llm-int8-8-bit-matrix-multiplication-for13.3--