Speaker Specific Lip To Speech Synthesis On 4
Metrics
ESTOI
PESQ
STOI
Results
Performance results of various models on this benchmark
Model Name | ESTOI | PESQ | STOI | Paper Title | Repository |
---|---|---|---|---|---|
Visual Voice Memory | 0.337 | 1.366 | 0.504 | Speech Reconstruction with Reminiscent Sound via Visual Voice Memory | |
Lip2Wav | 0.311 | 1.29 | 0.446 | Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis |
0 of 2 row(s) selected.