Language Modelling On C4
Metrics
Perplexity
Steps
TPUv3 Hours
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Perplexity | Steps | TPUv3 Hours |
---|---|---|---|
primer-searching-for-efficient-transformers | 12.69 | 1M | 16.5K |
llm-int8-8-bit-matrix-multiplication-for | 12.45 | - | - |
primer-searching-for-efficient-transformers | 13.25 | 1M | 15.7K |
n-grammer-augmenting-transformers-with-latent-1 | 14.79 | - | - |
llm-int8-8-bit-matrix-multiplication-for | 14.43 | - | - |
llm-int8-8-bit-matrix-multiplication-for | 15.91 | - | - |
primer-searching-for-efficient-transformers | 12.35 | 1M | 17.3K |
n-grammer-augmenting-transformers-with-latent-1 | 15.01 | - | - |
llm-int8-8-bit-matrix-multiplication-for | 13.3 | - | - |