HyperAI

Audio Visual Active Speaker Detection On Vpcd

Metrics

mean average precision

Results

Performance results of various models on this benchmark

Comparison Table
Model Namemean average precision
audio-visual-activity-guided-cross-modal83.90