Language Modelling On C4
评估指标
Perplexity
Steps
TPUv3 Hours
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Perplexity | Steps | TPUv3 Hours |
---|---|---|---|
primer-searching-for-efficient-transformers | 12.69 | 1M | 16.5K |
llm-int8-8-bit-matrix-multiplication-for | 12.45 | - | - |
primer-searching-for-efficient-transformers | 13.25 | 1M | 15.7K |
n-grammer-augmenting-transformers-with-latent-1 | 14.79 | - | - |
llm-int8-8-bit-matrix-multiplication-for | 14.43 | - | - |
llm-int8-8-bit-matrix-multiplication-for | 15.91 | - | - |
primer-searching-for-efficient-transformers | 12.35 | 1M | 17.3K |
n-grammer-augmenting-transformers-with-latent-1 | 15.01 | - | - |
llm-int8-8-bit-matrix-multiplication-for | 13.3 | - | - |