Audio Classification On Epic Kitchens 100
Metrics
Top-1 Action
Top-1 Noun
Top-1 Verb
Results
Performance results of various models on this benchmark
Comparison Table
Model Name | Top-1 Action | Top-1 Noun | Top-1 Verb |
---|---|---|---|
audiovisual-masked-autoencoders | 45.8 | 55.9 | 70.8 |
audiovisual-masked-autoencoders | 46.0 | 56.4 | 71.4 |
play-it-back-iterative-attention-for-audio | 15.9 | 23.1 | 47 |
audiovisual-masked-autoencoders | 19.7 | 27.2 | 52.7 |