HyperAI초신경

Speech Synthesis On Libritts

평가 지표

M-STFT
MCD
PESQ
Periodicity
V/UV F1

평가 결과

이 벤치마크에서 각 모델의 성능 결과

비교 표
모델 이름M-STFTMCDPESQPeriodicityV/UV F1
bigvsan-enhancing-gan-based-neural-vocoders-10.78810.33814.1160.09350.9635
bigvgan-a-universal-neural-vocoder-with-large0.70260.29034.3620.05930.9793
periodwave-multi-period-flow-matching-for1.0269-4.2480.07650.9651
rfwave-multi-band-rectified-flow-for-audio--4.2280.0900.968
waveglow-a-flow-based-generative-network-for1.30992.35913.1380.14850.9378
vocos-closing-the-gap-between-time-domain-and--3.700.1010.9582
waveflow-a-compact-flow-based-model-for-raw-11.11201.24553.0270.14160.9410
bigvsan-enhancing-gan-based-neural-vocoders-10.79920.41294.1200.09240.9644
bigvgan-a-universal-neural-vocoder-with-large0.79970.37454.0270.10180.9598
speaker-conditional-wavernn-towards-universal2.23581.88541.7010.30440.8144
accelerating-high-fidelity-waveform0.7358-4.4540.05280.9756
eva-gan-enhanced-various-audio-generation-via0.7982-4.35360.07510.9745
bigvgan-a-universal-neural-vocoder-with-large0.87880.45643.5190.12870.9459
hifi-gan-generative-adversarial-networks-for1.00170.66032.947 0.15650.9300
eva-gan-enhanced-various-audio-generation-via0.9485-4.03300.09420.9658