Audio Generation On Classical Music 5 Seconds

Bits per byte

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름	Bits per byte	Paper Title	Repository
Sparse Transformer 152M (strided)	1.97	Generating Long Sequences with Sparse Transformers
VAB-Encodec (Ours)	40	From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation

0 of 2 row(s) selected.