Search for a command to run...
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment