HyperAIHyperAI

Command Palette

Search for a command to run...

WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System

Yang Xiao Rohan Kumar Das

Abstract

This work aims to advance sound event detection (SED) research by presentinga new large language model (LLM)-powered dataset namely wild domesticenvironment sound event detection (WildDESED). It is crafted as an extension tothe original DESED dataset to reflect diverse acoustic variability and complexnoises in home settings. We leveraged LLMs to generate eight different domesticscenarios based on target sound categories of the DESED dataset. Then weenriched the scenarios with a carefully tailored mixture of noises selectedfrom AudioSet and ensured no overlap with target sound. We consider widelypopular convolutional neural recurrent network to study WildDESED dataset,which depicts its challenging nature. We then apply curriculum learning bygradually increasing noise complexity to enhance the model's generalizationcapabilities across various noise levels. Our results with this approach showimprovements within the noisy environment, validating the effectiveness on theWildDESED dataset promoting noise-robust SED advancements.


Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp