Search for a command to run...
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval