Search for a command to run...
Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language