» Articles » PMID: 35281195

Multimode Gesture Recognition Algorithm Based on Convolutional Long Short-Term Memory Network

Overview
Specialty Biology
Date 2022 Mar 14
PMID 35281195
Authors
Affiliations
Soon will be listed here.
Abstract

Gesture recognition utilizes deep learning network model to automatically extract deep features of data; however, traditional machine learning algorithms rely on manual feature extraction and poor model generalization ability. In this paper, a multimodal gesture recognition algorithm based on convolutional long-term memory network is proposed. First, a convolutional neural network (CNN) is employed to automatically extract the deeply hidden features of multimodal gesture data. Then, a time series model is constructed using a long short-term memory (LSTM) network to learn the long-term dependence of multimodal gesture features on the time series. On this basis, the classification of multimodal gestures is realized by the SoftMax classifier. Finally, the method is experimented and evaluated on two dynamic gesture datasets, VIVA and NVGesture. Experimental results indicate that the accuracy rates of the proposed method on the VIVA and NVGesture datasets are 92.55% and 87.38%, respectively, and its recognition accuracy and convergence performance are better than those of other comparison algorithms.

Citing Articles

LAVRF: Sign language recognition via Lightweight Attentive VGG16 with Random Forest.

Ewe E, Lee C, Lim K, Kwek L, Alqahtani A PLoS One. 2024; 19(4):e0298699.

PMID: 38574042 PMC: 10994320. DOI: 10.1371/journal.pone.0298699.

References
1.
Zhang R, Zhao L, Lou W, Abrigo J, Mok V, Chu W . Automatic Segmentation of Acute Ischemic Stroke From DWI Using 3-D Fully Convolutional DenseNets. IEEE Trans Med Imaging. 2018; 37(9):2149-2160. DOI: 10.1109/TMI.2018.2821244. View

2.
Qi Y, Li Q, Karimian H, Liu D . A hybrid model for spatiotemporal forecasting of PM based on graph convolutional neural network and long short-term memory. Sci Total Environ. 2019; 664:1-10. DOI: 10.1016/j.scitotenv.2019.01.333. View

3.
Oudah M, Al-Naji A, Chahl J . Hand Gesture Recognition Based on Computer Vision: A Review of Techniques. J Imaging. 2021; 6(8). PMC: 8321080. DOI: 10.3390/jimaging6080073. View

4.
Kawaguchi K, Bengio Y . Depth with nonlinearity creates no bad local minima in ResNets. Neural Netw. 2019; 118:167-174. DOI: 10.1016/j.neunet.2019.06.009. View

5.
Ruszczycki B, Pels K, Walczak A, Zamlynska K, Such M, Szczepankiewicz A . Three-Dimensional Segmentation and Reconstruction of Neuronal Nuclei in Confocal Microscopic Images. Front Neuroanat. 2019; 13:81. PMC: 6710455. DOI: 10.3389/fnana.2019.00081. View