Random Deep Belief Networks for Recognizing Emotions from Speech Signals

Overview

Journal Comput Intell Neurosci

Specialty Biology

Date 2017 Mar 31

PMID 28356908

Citations 10

Authors

Guihua Wen

Huihui Li

Jubing Huang

Danyang Li

Eryang Xun

Affiliations

Soon will be listed here.

Abstract

Now the human emotions can be recognized from speech signals using machine learning methods; however, they are challenged by the lower recognition accuracies in real applications due to lack of the rich representation ability. Deep belief networks (DBN) can automatically discover the multiple levels of representations in speech signals. To make full of its advantages, this paper presents an ensemble of random deep belief networks (RDBN) method for speech emotion recognition. It firstly extracts the low level features of the input speech signal and then applies them to construct lots of random subspaces. Each random subspace is then provided for DBN to yield the higher level features as the input of the classifier to output an emotion label. All outputted emotion labels are then fused through the majority voting to decide the final emotion label for the input speech signal. The conducted experimental results on benchmark speech emotion databases show that RDBN has better accuracy than the compared methods for speech emotion recognition.

Citing Articles

An enhanced speech emotion recognition using vision transformer.

Akinpelu S, Viriri S, Adegun A Sci Rep. 2024; 14(1):13126.

PMID: 38849422 PMC: 11161461. DOI: 10.1038/s41598-024-63776-4.

Deep learning-based EEG emotion recognition: Current trends and future perspectives.

Wang X, Ren Y, Luo Z, He W, Hong J, Huang Y Front Psychol. 2023; 14:1126994.

PMID: 36923142 PMC: 10009917. DOI: 10.3389/fpsyg.2023.1126994.

Multi-Stream Convolution-Recurrent Neural Networks Based on Attention Mechanism Fusion for Speech Emotion Recognition.

Tao H, Geng L, Shan S, Mai J, Fu H Entropy (Basel). 2022; 24(8).

PMID: 35893005 PMC: 9331177. DOI: 10.3390/e24081025.

Bidirectional parallel echo state network for speech emotion recognition.

Ibrahim H, Loo C, Alnajjar F Neural Comput Appl. 2022; 34(20):17581-17599.

PMID: 35669535 PMC: 9152839. DOI: 10.1007/s00521-022-07410-2.

The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning.

Costantini G, Parada-Cabaleiro E, Casali D, Cesarini V Sensors (Basel). 2022; 22(7).

PMID: 35408076 PMC: 9003467. DOI: 10.3390/s22072461.

References

France D, Shiavi R, Silverman S, Silverman M, Wilkes D . Acoustical properties of speech as indicators of depression and suicidal risk. IEEE Trans Biomed Eng. 2000; 47(7):829-37. DOI: 10.1109/10.846676. View

Kim S, Yu Z, Kil R, Lee M . Deep learning of support vector machines with class probability output networks. Neural Netw. 2014; 64:19-28. DOI: 10.1016/j.neunet.2014.09.007. View

Bengio Y, Lee H . Editorial introduction to the Neural Networks special issue on Deep Learning of Representations. Neural Netw. 2015; 64:1-3. DOI: 10.1016/j.neunet.2014.12.006. View