Search for a command to run...
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition