Date

8 months ago

Size

389.35 GB

Organization

Paper URL

2502.05674

License

Apache 2.0

Tags

Text-to-Speech

Audio and Speech Processing

Audio Recognition

Synthesis

ShiftySpeech is a large-scale synthetic speech detection benchmark released by Johns Hopkins University in 2025. The related paper is titled "ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution ShiftsThe aim is to study the generalization ability of speech synthesis detection models in the real world when faced with "distribution drift" (including changes in language, speaker, generation model, and recording conditions). This dataset contains over 3,000 hours of synthesized speech, covering seven source domains, including reading styles, podcasts, YouTube recordings, and other scenarios with background noise or non-standard recording conditions, as well as variations in language, speaker age, accent, and gender. The data covers three languages (English, Chinese, and Japanese), and speech was generated using six TTS (text-to-speech) systems and twelve vocoders (vocoders/waveform generators) to construct different degrees of system distribution drift.

ShiftySpeech.torrent

Seeding 1Downloading 0Completed 2Total Downloads 94

ShiftySpeech/
- README.md
  1.6 KB
- README.txt
  3.2 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

8 months ago

Size

389.35 GB

Organization

Paper URL

2502.05674

License

Apache 2.0

Related Datasets

Groundsource Global Flood Events Dataset

3 months ago

CHIMERA General Inference Synthetic Dataset

4 months ago

THINGS-EEG EEG Dataset

5 months ago

THINGS-MEG Magnetoencephalography Dataset

5 months ago

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

5 months ago

RubricHub_v1 Multi-Domain Generative Task Dataset

5 months ago

X-ray Contraband Detection Dataset

6 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

ShiftySpeech Speech Distribution Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

ShiftySpeech Speech Distribution Evaluation Dataset

Related Datasets

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

X-ray Contraband Detection Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

ShiftySpeech Speech Distribution Evaluation Dataset

Related Datasets

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

X-ray Contraband Detection Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

X-ray Contraband Detection Dataset

Related Datasets

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

RubricHub_v1 Multi-Domain Generative Task Dataset

X-ray Contraband Detection Dataset