HyperAI
Accueil
Actualités
Articles de recherche récents
Tutoriels
Ensembles de données
Wiki
SOTA
Modèles LLM
Classement GPU
Événements
Recherche
À propos
Français
HyperAI
Toggle sidebar
Rechercher sur le site...
⌘
K
Accueil
SOTA
Speaker Identification
Speaker Identification On Voxceleb1
Speaker Identification On Voxceleb1
Métriques
Accuracy
Top-1 (%)
Résultats
Résultats de performance de divers modèles sur ce benchmark
Columns
Nom du modèle
Accuracy
Top-1 (%)
Paper Title
Repository
SSAST-PATCH
64.2
64.2
SSAST: Self-Supervised Audio Spectrogram Transformer
AutoSpeech (N=8,C=128)
87.66
87.66
AutoSpeech: Neural Architecture Search for Speaker Recognition
SSAST-FRAME
80.8
80.8
SSAST: Self-Supervised Audio Spectrogram Transformer
ATST Base (ours)
94.3
94.3
ATST: Audio Representation Learning with Teacher-Student Transformer
M2D ratio=0.6
94.8
94.8
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input
M2D/0.6
96.5
96.5
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
AudioMAE (local)
94.8
94.8
Masked Autoencoders that Listen
MSM-MAE
96.6
96.6
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
AudioMAE (global)
94.1
94.1
Masked Autoencoders that Listen
M2D/0.7
96.3
96.3
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
COLA
37.7
37.7
Contrastive Learning of General-Purpose Audio Representations
SSAMBA
70.1
70.1
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
0 of 12 row(s) selected.
Previous
Next