HyperAI超神经
首页
资讯
最新论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
首页
SOTA
Speaker Identification
Speaker Identification On Voxceleb1
Speaker Identification On Voxceleb1
评估指标
Accuracy
Top-1 (%)
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
Accuracy
Top-1 (%)
Paper Title
Repository
SSAST-PATCH
64.2
64.2
SSAST: Self-Supervised Audio Spectrogram Transformer
AutoSpeech (N=8,C=128)
87.66
87.66
AutoSpeech: Neural Architecture Search for Speaker Recognition
SSAST-FRAME
80.8
80.8
SSAST: Self-Supervised Audio Spectrogram Transformer
ATST Base (ours)
94.3
94.3
ATST: Audio Representation Learning with Teacher-Student Transformer
M2D ratio=0.6
94.8
94.8
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input
M2D/0.6
96.5
96.5
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
AudioMAE (local)
94.8
94.8
Masked Autoencoders that Listen
MSM-MAE
96.6
96.6
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
AudioMAE (global)
94.1
94.1
Masked Autoencoders that Listen
M2D/0.7
96.3
96.3
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
COLA
37.7
37.7
Contrastive Learning of General-Purpose Audio Representations
SSAMBA
70.1
70.1
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
0 of 12 row(s) selected.
Previous
Next