Zero Shot Video Retrieval On Youcook2
評価指標
text-to-video Median Rank
text-to-video R@1
text-to-video R@10
text-to-video R@5
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
比較表
モデル名 | text-to-video Median Rank | text-to-video R@1 | text-to-video R@10 | text-to-video R@5 |
---|---|---|---|---|
howtocaption-prompting-llms-to-transform | 8 | 19.7 | 53.9 | 43.6 |
video-text-modeling-with-zero-shot-transfer | - | 20.3 | 53.3 | 43.0 |
taco-token-aware-cascade-contrastive-learning | - | 19.9 | 55.7 | 43.2 |
omnivec2-a-novel-transformer-based-network | - | 26.1 | 70.8 | 54.1 |
videoclip-contrastive-pre-training-for-zero | - | 22.7 | 63.1 | 50.4 |
howtocaption-prompting-llms-to-transform | 15 | 13.4 | 44.1 | 33.1 |
vatt-transformers-for-multimodal-self | - | - | 45.5 | - |
end-to-end-learning-of-visual-representations | - | 15.1 | 51.2 | 38.0 |
multi-granularity-correspondence-learning-1 | - | 24.2 | 64.1 | 51.9 |