HyperAI
الرئيسية
الأخبار
أحدث الأوراق البحثية
الدروس
مجموعات البيانات
الموسوعة
SOTA
نماذج LLM
لوحة الأداء GPU
الفعاليات
البحث
حول
العربية
HyperAI
Toggle sidebar
البحث في الموقع...
⌘
K
الرئيسية
SOTA
Speaker Identification
Speaker Identification On Voxceleb1
Speaker Identification On Voxceleb1
المقاييس
Accuracy
Top-1 (%)
النتائج
نتائج أداء النماذج المختلفة على هذا المعيار القياسي
Columns
اسم النموذج
Accuracy
Top-1 (%)
Paper Title
Repository
SSAST-PATCH
64.2
64.2
SSAST: Self-Supervised Audio Spectrogram Transformer
AutoSpeech (N=8,C=128)
87.66
87.66
AutoSpeech: Neural Architecture Search for Speaker Recognition
SSAST-FRAME
80.8
80.8
SSAST: Self-Supervised Audio Spectrogram Transformer
ATST Base (ours)
94.3
94.3
ATST: Audio Representation Learning with Teacher-Student Transformer
M2D ratio=0.6
94.8
94.8
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input
M2D/0.6
96.5
96.5
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
AudioMAE (local)
94.8
94.8
Masked Autoencoders that Listen
MSM-MAE
96.6
96.6
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
AudioMAE (global)
94.1
94.1
Masked Autoencoders that Listen
M2D/0.7
96.3
96.3
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
COLA
37.7
37.7
Contrastive Learning of General-Purpose Audio Representations
SSAMBA
70.1
70.1
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
0 of 12 row(s) selected.
Previous
Next