Language Modelling On C4
평가 지표
Perplexity
Steps
TPUv3 Hours
평가 결과
이 벤치마크에서 각 모델의 성능 결과
비교 표
모델 이름 | Perplexity | Steps | TPUv3 Hours |
---|---|---|---|
primer-searching-for-efficient-transformers | 12.69 | 1M | 16.5K |
llm-int8-8-bit-matrix-multiplication-for | 12.45 | - | - |
primer-searching-for-efficient-transformers | 13.25 | 1M | 15.7K |
n-grammer-augmenting-transformers-with-latent-1 | 14.79 | - | - |
llm-int8-8-bit-matrix-multiplication-for | 14.43 | - | - |
llm-int8-8-bit-matrix-multiplication-for | 15.91 | - | - |
primer-searching-for-efficient-transformers | 12.35 | 1M | 17.3K |
n-grammer-augmenting-transformers-with-latent-1 | 15.01 | - | - |
llm-int8-8-bit-matrix-multiplication-for | 13.3 | - | - |