Speaker Specific Lip To Speech Synthesis On 4

ESTOI

PESQ

STOI

Results

Performance results of various models on this benchmark

Model Name	ESTOI	PESQ	STOI	Paper Title	Repository
Visual Voice Memory	0.337	1.366	0.504	Speech Reconstruction with Reminiscent Sound via Visual Voice Memory	-
Lip2Wav	0.311	1.29	0.446	Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis

0 of 2 row(s) selected.