Speaker Specific Lip To Speech Synthesis On 5
Metrics
ESTOI
PESQ
STOI
Results
Performance results of various models on this benchmark
Model Name | ESTOI | PESQ | STOI | Paper Title | Repository |
---|---|---|---|---|---|
Lip2Wav | 0.183 | 1.671 | 0.282 | Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis | |
Visual Voice Memory | 0.402 | 1.612 | 0.576 | Speech Reconstruction with Reminiscent Sound via Visual Voice Memory |
0 of 2 row(s) selected.