Gzsl Video Classification On Vggsound Gzsl 1
Métriques
HM
ZSL
Résultats
Résultats de performance de divers modèles sur ce benchmark
Nom du modèle | HM | ZSL | Paper Title | Repository |
---|---|---|---|---|
Hyper-multiple | 9.32 | 7.97 | Hyperbolic Audio-visual Zero-shot Learning | - |
CJME | 6.17 | 5.16 | Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos | - |
AVCA | 6.31 | 6.00 | Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language | |
KDA | 10.45 | 8.43 | Boosting Audio-visual Zero-shot Learning with Large Language Models | |
APN | 5.11 | 4.49 | Attribute Prototype Network for Any-Shot Learning | - |
AVGZSLNet | 5.83 | 5.28 | AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings | - |
TCaF | 7.33 | 6.06 | Temporal and cross-modal attention for audio-visual zero-shot learning |
0 of 7 row(s) selected.