Date

2 years ago

Size

120.7 MB

Organization

Publish URL

github.com

Paper URL

openreview.net

License

CC BY 4.0

Tags

Audio Recognition

Audio Classification

The dataset was released in 2024 by researchers from Northwestern Polytechnical University, Xi'an Lianfeng Acoustic Technology Co., Ltd., Nanyang Technological University, University of Surrey, and the Institute of Acoustics, Chinese Academy of Sciences.AudioSetCaps: Enriched Audio Captioning Dataset Generation Using Large Audio Language Models", has been accepted by NeurIPS 24. AudioSetCaps is an audio-caption dataset containing 6,117,099 10-second audio files. Each audio file is accompanied by a descriptive title and 3 Q&A pairs as metadata for generating the final caption (a total of 18,414,789 pairs of Q&A data). It is created using an automated generation pipeline of large audio and language models using data from three audio datasets: AudioSet, YouTube-8M, and VGGSound.

AudioSetCaps.torrent

Seeding 2Downloading 0Completed 125Total Downloads 258

AudioSetCaps/
- README.md
  1.63 KB
- README.txt
  3.27 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

2 years ago

Size

120.7 MB

Organization

Publish URL

github.com

Paper URL

openreview.net

License

CC BY 4.0

Related Datasets

Groundsource Global Flood Events Dataset

3 months ago

RubricHub_v1 Multi-Domain Generative Task Dataset

5 months ago

RoVid-X Robot Video Generation Dataset

2 months ago

LightOnOCR-mix-0126 Text Transcription Dataset

5 months ago

TxT360-3efforts Multi-Task Inference Dataset

6 months ago

X-ray Contraband Detection Dataset

6 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

AudioSetCaps Audio Subtitle Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

AudioSetCaps Audio Subtitle Dataset

Related Datasets

Groundsource Global Flood Events Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

RoVid-X Robot Video Generation Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

TxT360-3efforts Multi-Task Inference Dataset

X-ray Contraband Detection Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

AudioSetCaps Audio Subtitle Dataset

Related Datasets

Groundsource Global Flood Events Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

RoVid-X Robot Video Generation Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

TxT360-3efforts Multi-Task Inference Dataset

X-ray Contraband Detection Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Groundsource Global Flood Events Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

RoVid-X Robot Video Generation Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

TxT360-3efforts Multi-Task Inference Dataset

X-ray Contraband Detection Dataset

Related Datasets

Groundsource Global Flood Events Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

RoVid-X Robot Video Generation Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

TxT360-3efforts Multi-Task Inference Dataset

X-ray Contraband Detection Dataset