AVSD Audio-Visual Scene Aware Dialogue Dataset
Date
3 years ago
Publish URL
Categories

AVSD stands for The Audio Visual Scene-Aware Dialog (or DSTC7 Track 3) is an audio-visual dataset for understanding dialogue. The dataset aims to build a system and respond to the dialogue in the input video.