HyperAI

Target Sound Extraction

Target Sound Extraction is the task of extracting specific sound categories from audio mixture signals. This task aims to separate designated foreground sound components from complex audio environments containing background noise. The input form to the model can be a string, integer, or one-hot encoding of sound categories. This technique has significant application value in fields such as speech recognition, music processing, and environmental acoustic monitoring.