Stochastic Optimization On Penn Treebank
평가 지표
Bit per Character (BPC)
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | Bit per Character (BPC) | Paper Title | Repository |
---|---|---|---|
AdaShift | 1.274 | Domain-independent Dominance of Adaptive Methods | - |
AdaBound | 2.863 | Domain-independent Dominance of Adaptive Methods | - |
AvaGrad | 1.175 | Domain-independent Dominance of Adaptive Methods | - |
AdamW | 1.23 | Domain-independent Dominance of Adaptive Methods | - |
0 of 4 row(s) selected.