Date

4 years ago

Organization

Publish URL

www.robots.ox.ac.uk

Paper URL

arxiv.org

License

Other

Tags

Audio and Speech Processing

Object Detection

Video Processing

VGG-SS, short for VGG Sound Source, is a video dataset for evaluating sound source localization. The dataset contains more than 200 categories, 5,000 videos, and new annotations for the VGG-Sound dataset, which is 10 times larger than existing datasets. The visible sound sources in each video clip are clearly marked with bounding boxes. Unlike Flickr SoundNet, the sound source localization of this dataset is based on videos.

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Discuss on Discord

Date

4 years ago

Organization

Publish URL

www.robots.ox.ac.uk

Paper URL

arxiv.org

License

Other

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

VGG-SS Sound Source Localization Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

VGG-SS Sound Source Localization Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

VGG-SS Sound Source Localization Dataset

Build AI with AI

HyperAI Newsletters