Date

7 years ago

Acoustic ModelUsed to calculate the probability of the model generating a speech waveform. It is one of the most important parts in the speech recognition system and accounts for most of the computing overhead, determining the performance of the speech recognition system.

Development History

Traditional methods: Based on hidden Markov acoustic models, such as the GMM-HMM modeling method - GMM is used to model the distribution of speech acoustic features, and HMM is used to model the temporal nature of speech signals;
Deep neural network: used for speech acoustic model. Hinton and his students used feedforward fully connected deep neural network for speech recognition in 2009, which had better performance than the DNN-HMM-based acoustic model on the TIMIT dataset.
Utilizing variable-length context information: In 2015, acoustic models that utilize variable-length speech information were put into use. The optimal length of speech information is affected by phonemes and speaking speed. Fixed-length context windows are not the best choice in DNN-HMM hybrid systems. New models in recent years are mainly based on recurrent neural networks (RNN) and convolutional neural networks (CNN).

References

【1】Acoustic Model of Speech Recognition Technology – 52AI Artificial Intelligence – CSDN Blog

【2】Yu Dong, Deputy Director of Tencent AI Lab: Progress in acoustic models based on deep learning in the past two years | Machine Heart

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

7 years ago

Development History

Traditional methods: Based on hidden Markov acoustic models, such as the GMM-HMM modeling method - GMM is used to model the distribution of speech acoustic features, and HMM is used to model the temporal nature of speech signals;
Deep neural network: used for speech acoustic model. Hinton and his students used feedforward fully connected deep neural network for speech recognition in 2009, which had better performance than the DNN-HMM-based acoustic model on the TIMIT dataset.
Utilizing variable-length context information: In 2015, acoustic models that utilize variable-length speech information were put into use. The optimal length of speech information is affected by phonemes and speaking speed. Fixed-length context windows are not the best choice in DNN-HMM hybrid systems. New models in recent years are mainly based on recurrent neural networks (RNN) and convolutional neural networks (CNN).

References

【1】Acoustic Model of Speech Recognition Technology – 52AI Artificial Intelligence – CSDN Blog

【2】Yu Dong, Deputy Director of Tencent AI Lab: Progress in acoustic models based on deep learning in the past two years | Machine Heart

Fully Homomorphic Encryption (FHE)

FHE is widely used in scenarios such as cloud computing security, federated learning, medical data analysis, and financial data collaboration.

3 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Acoustic Modeling

Development History

References

Build AI with AI

HyperAI Newsletters

Command Palette

Acoustic Modeling

Development History

References

Fully Homomorphic Encryption (FHE)

Build AI with AI

HyperAI Newsletters

Command Palette

Acoustic Modeling

Development History

References

Fully Homomorphic Encryption (FHE)

Build AI with AI

HyperAI Newsletters

Fully Homomorphic Encryption (FHE)

Fully Homomorphic Encryption (FHE)