Speaker Specific Lip To Speech Synthesis On 5

ESTOI

PESQ

STOI

Results

Performance results of various models on this benchmark

Model Name	ESTOI	PESQ	STOI	Paper Title	Repository
Lip2Wav	0.183	1.671	0.282	Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Visual Voice Memory	0.402	1.612	0.576	Speech Reconstruction with Reminiscent Sound via Visual Voice Memory	-

0 of 2 row(s) selected.