HyperAI

Audio Visual Active Speaker Detection On Ava

المقاييس

validation mean average precision

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

اسم النموذج
validation mean average precision
Paper TitleRepository
LoCoNet95.2%LoCoNet: Long-Short Context Network for Active Speaker Detection
MAAS-TAN88.8%MAAS: Multi-modal Assignation for Active Speaker Detection
SPELL94.2%Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection
3D-ResNet-GRU84.0%Multi-Task Learning for Audio Visual Active Speaker Detection-
ASDNet93.5%How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild
LoCoNet + Laser95.3%LASER: Lip Landmark Assisted Speaker Detection for Robustness-
SPELL+94.9%Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection
VGG-{LSTM+TCN} (ensemble)87.8%Naver at ActivityNet Challenge 2019 -- Task B Active Speaker Detection (AVA)-
MAAS-LAN85.1%MAAS: Multi-modal Assignation for Active Speaker Detection
Active Speakers in Context87.1%Active Speakers in Context-
LoCoNet+TalkNCE95.5%TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning
UniCon92.0%UniCon: Unified Context Network for Robust Active Speaker Detection-
GSCMIA92.86%Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection
VTP (visual only)89.2%Sub-word Level Lip Reading With Visual Attention-
SA-uncertainty Fusion91.9%Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-based Multimodal Fusion-
EASEE-5094.1%End-to-End Active Speaker Detection
Extended UniCon93.6%ICTCAS-UCAS-TAL Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2021-
UniCon+94.5%UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022-
Light-ASD94.1%A Light Weight Model for Active Speaker Detection
TalkNet92.3%NUS-HLT Report for ActivityNet Challenge 2021 AVA (Speaker)
0 of 20 row(s) selected.