Audio
Performance metrics of mainstream AI models across various tasks, showcasing the state-of-the-art technology
AI Model Performance Benchmarks
Performance metrics of mainstream AI models across various tasks, showcasing the state-of-the-art technology
Target Speaker Extraction
49 papers | 0 benchmarks
Inference Optimization
48 papers | 0 benchmarks
Room Impulse Response (RIR)
46 papers | 0 benchmarks
Bandwidth Extension
45 papers | 6 benchmarks
Audio Classification
44 papers | 26 benchmarks
Robust Speech Recognition
44 papers | 0 benchmarks
Sound Source Localization
44 papers | 0 benchmarks
Voice Cloning
44 papers | 0 benchmarks
Environmental Sound Classification
43 papers | 3 benchmarks
Sound Event Detection
41 papers | 5 benchmarks
Voice Conversion
41 papers | 3 benchmarks
Audio Compression
38 papers | 0 benchmarks
Direction of Arrival Estimation
38 papers | 1 benchmarks
Speech Synthesis
37 papers | 5 benchmarks
Audio Generation
36 papers | 3 benchmarks
Text-to-Music Generation
36 papers | 2 benchmarks
Instrument Recognition
35 papers | 3 benchmarks
Language Identification
33 papers | 6 benchmarks
Sound Classification
31 papers | 0 benchmarks
Acoustic Scene Classification
30 papers | 5 benchmarks
Audio Tagging
25 papers | 1 benchmarks
Audio-Visual Synchronization
25 papers | 0 benchmarks
Synthetic Speech Detection
25 papers | 0 benchmarks
Audio Super-Resolution
22 papers | 4 benchmarks
Voice Anti-spoofing
22 papers | 3 benchmarks
Chord Recognition
21 papers | 0 benchmarks
Pitch control
21 papers | 0 benchmarks
Audio Denoising
19 papers | 3 benchmarks
Packet Loss Concealment
19 papers | 0 benchmarks
Beat Tracking
18 papers | 15 benchmarks
Audio inpainting
17 papers | 0 benchmarks
Audio Signal Processing
17 papers | 0 benchmarks
Audio Quality Assessment
15 papers | 1 benchmarks
Target Sound Extraction
15 papers | 3 benchmarks
Audio captioning
11 papers | 2 benchmarks
Downbeat Tracking
11 papers | 13 benchmarks
Sound Event Localization and Detection
11 papers | 5 benchmarks
Video-to-Sound Generation
9 papers | 1 benchmarks
Audio Source Separation
8 papers | 2 benchmarks
Few-Shot Audio Classification
8 papers | 10 benchmarks
Vowel Classification
7 papers | 0 benchmarks
Zero-Shot Multi-Speaker TTS
7 papers | 0 benchmarks
Audio declipping
6 papers | 0 benchmarks
Bird Audio Detection
6 papers | 0 benchmarks
Visually Guided Sound Source Separation
6 papers | 0 benchmarks
Zero-shot Audio Captioning
6 papers | 2 benchmarks
blind source separation
5 papers | 0 benchmarks
Music Source Separation
5 papers | 3 benchmarks
Retrieval-augmented Few-shot In-context Audio Captioning
5 papers | 1 benchmarks
Active Speaker Localization
4 papers | 1 benchmarks
Audio Effects Modeling
4 papers | 0 benchmarks
Audio Fingerprint
4 papers | 0 benchmarks
Music Compression
4 papers | 0 benchmarks
Soundscape evaluation
4 papers | 0 benchmarks
Acoustic Novelty Detection
3 papers | 1 benchmarks
Audio Dequantization
3 papers | 0 benchmarks
Audio Signal Recognition
3 papers | 0 benchmarks
Gunshot Detection
3 papers | 0 benchmarks
Lung Sound Classification
3 papers | 1 benchmarks
Music Genre Transfer
3 papers | 0 benchmarks
Underwater Acoustic Classification
3 papers | 0 benchmarks
audio moment retrieval
2 papers | 0 benchmarks
fake voice detection
2 papers | 1 benchmarks
Music Quality Assessment
2 papers | 0 benchmarks
Timbre Interpolation
2 papers | 0 benchmarks
Anime
1 papers | 0 benchmarks
Audio Multiple Target Classification
1 papers | 0 benchmarks
Bird Classification
1 papers | 0 benchmarks
Bird Species Classification With Audio-Visual Data
1 papers | 0 benchmarks
Cadenza 1 - Task 1 - Headphone
1 papers | 1 benchmarks
Cadenza 1 - Task 2 - In Car
1 papers | 1 benchmarks
Directional Hearing
1 papers | 1 benchmarks
Hearing Aid and device processing
1 papers | 0 benchmarks
Music Generation
1 papers | 1 benchmarks
Parkinson Detection from Speech
1 papers | 0 benchmarks
Real-time Directional Hearing
1 papers | 1 benchmarks
Self-Supervised Sound Classification
1 papers | 0 benchmarks
Semi-supervised Audio Classification
1 papers | 0 benchmarks
Semi-Supervised Audio Regression
1 papers | 0 benchmarks
Shooter Localization
1 papers | 0 benchmarks
Signal Processing
1 papers | 0 benchmarks
Single-Label Target Sound Extraction
1 papers | 0 benchmarks
Speaker Orientation
1 papers | 0 benchmarks
Streaming Target Sound Extraction
1 papers | 1 benchmarks
Synthetic Song Detection
1 papers | 0 benchmarks
Unsupervised Few-Shot Audio Classification
1 papers | 0 benchmarks
Video/Text-to-Audio Generation
1 papers | 0 benchmarks