HyperAI超神経
ホーム
ニュース
最新論文
チュートリアル
データセット
百科事典
SOTA
LLMモデル
GPU ランキング
学会
検索
サイトについて
日本語
HyperAI超神経
Toggle sidebar
サイトを検索…
⌘
K
ホーム
SOTA
Speaker Identification
Speaker Identification On Voxceleb1
Speaker Identification On Voxceleb1
評価指標
Accuracy
Top-1 (%)
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
Columns
モデル名
Accuracy
Top-1 (%)
Paper Title
Repository
SSAST-PATCH
64.2
64.2
SSAST: Self-Supervised Audio Spectrogram Transformer
AutoSpeech (N=8,C=128)
87.66
87.66
AutoSpeech: Neural Architecture Search for Speaker Recognition
SSAST-FRAME
80.8
80.8
SSAST: Self-Supervised Audio Spectrogram Transformer
ATST Base (ours)
94.3
94.3
ATST: Audio Representation Learning with Teacher-Student Transformer
M2D ratio=0.6
94.8
94.8
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input
M2D/0.6
96.5
96.5
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
AudioMAE (local)
94.8
94.8
Masked Autoencoders that Listen
MSM-MAE
96.6
96.6
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
AudioMAE (global)
94.1
94.1
Masked Autoencoders that Listen
M2D/0.7
96.3
96.3
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
COLA
37.7
37.7
Contrastive Learning of General-Purpose Audio Representations
SSAMBA
70.1
70.1
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
0 of 12 row(s) selected.
Previous
Next