» Articles » PMID: 28356908

Random Deep Belief Networks for Recognizing Emotions from Speech Signals

Overview
Specialty Biology
Date 2017 Mar 31
PMID 28356908
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Now the human emotions can be recognized from speech signals using machine learning methods; however, they are challenged by the lower recognition accuracies in real applications due to lack of the rich representation ability. Deep belief networks (DBN) can automatically discover the multiple levels of representations in speech signals. To make full of its advantages, this paper presents an ensemble of random deep belief networks (RDBN) method for speech emotion recognition. It firstly extracts the low level features of the input speech signal and then applies them to construct lots of random subspaces. Each random subspace is then provided for DBN to yield the higher level features as the input of the classifier to output an emotion label. All outputted emotion labels are then fused through the majority voting to decide the final emotion label for the input speech signal. The conducted experimental results on benchmark speech emotion databases show that RDBN has better accuracy than the compared methods for speech emotion recognition.

Citing Articles

An enhanced speech emotion recognition using vision transformer.

Akinpelu S, Viriri S, Adegun A Sci Rep. 2024; 14(1):13126.

PMID: 38849422 PMC: 11161461. DOI: 10.1038/s41598-024-63776-4.


Deep learning-based EEG emotion recognition: Current trends and future perspectives.

Wang X, Ren Y, Luo Z, He W, Hong J, Huang Y Front Psychol. 2023; 14:1126994.

PMID: 36923142 PMC: 10009917. DOI: 10.3389/fpsyg.2023.1126994.


Multi-Stream Convolution-Recurrent Neural Networks Based on Attention Mechanism Fusion for Speech Emotion Recognition.

Tao H, Geng L, Shan S, Mai J, Fu H Entropy (Basel). 2022; 24(8).

PMID: 35893005 PMC: 9331177. DOI: 10.3390/e24081025.


Bidirectional parallel echo state network for speech emotion recognition.

Ibrahim H, Loo C, Alnajjar F Neural Comput Appl. 2022; 34(20):17581-17599.

PMID: 35669535 PMC: 9152839. DOI: 10.1007/s00521-022-07410-2.


The Emotion Probe: On the Universality of Cross-Linguistic and Cross-Gender Speech Emotion Recognition via Machine Learning.

Costantini G, Parada-Cabaleiro E, Casali D, Cesarini V Sensors (Basel). 2022; 22(7).

PMID: 35408076 PMC: 9003467. DOI: 10.3390/s22072461.


References
1.
France D, Shiavi R, Silverman S, Silverman M, Wilkes D . Acoustical properties of speech as indicators of depression and suicidal risk. IEEE Trans Biomed Eng. 2000; 47(7):829-37. DOI: 10.1109/10.846676. View

2.
Kim S, Yu Z, Kil R, Lee M . Deep learning of support vector machines with class probability output networks. Neural Netw. 2014; 64:19-28. DOI: 10.1016/j.neunet.2014.09.007. View

3.
Bengio Y, Lee H . Editorial introduction to the Neural Networks special issue on Deep Learning of Representations. Neural Netw. 2015; 64:1-3. DOI: 10.1016/j.neunet.2014.12.006. View