HyperAI
HyperAI
Accueil
Actualités
Articles de recherche récents
Tutoriels
Ensembles de données
Wiki
SOTA
Modèles LLM
Classement GPU
Événements
Recherche
À propos
Français
HyperAI
HyperAI
Toggle sidebar
Rechercher sur le site...
⌘
K
Accueil
SOTA
Séparation vocale
Speech Separation On Whamr
Speech Separation On Whamr
Métriques
SI-SDRi
Résultats
Résultats de performance de divers modèles sur ce benchmark
Columns
Nom du modèle
SI-SDRi
Paper Title
Repository
SepReformer-L + DM
17.1
Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation
-
Bi-LSTM-TASNET
9.2
WHAM!: Extending Speech Separation to Noisy Environments
-
DPTNET - SRSSN
12.3
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
-
TD-Confomer (S)
10.5
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
-
MossFormer (L) + DM
16.3
MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions
-
TD-Conformer (L) + DM
13.4
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
-
VSUNOS
12.2
Voice Separation with an Unknown Number of Multiple Speakers
-
DPRNN - SRSSN
12.3
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain
-
TF-Locoformer (M)
18.5
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
-
Deformable TCN + Dynamic Mixing
11.1
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
-
TD-Conformer (XL) + DM
14.6
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
-
Wavesplit
13.2
Wavesplit: End-to-End Speech Separation by Speaker Clustering
-
Improved Sudo rm -rf (U=36)
13.5
Compute and memory efficient universal sound source separation
-
Deformable TCN + Shared Weights + Dynamic Mixing
10.1
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
-
MossFormer2
17.0
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation
-
Sudo rm -rf (U=16)
12.1
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation
-
TF-Locoformer (S)
17.4
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
-
TD-Confomer (M) + DM
12
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments
-
0 of 18 row(s) selected.
Previous
Next
Speech Separation On Whamr | SOTA | HyperAI