» Articles » PMID: 22392705

3D Convolutional Neural Networks for Human Action Recognition

Overview
Date 2012 Mar 7
PMID 22392705
Citations 318
Authors
Affiliations
Soon will be listed here.
Abstract

We consider the automated recognition of human actions in surveillance videos. Most current methods build classifiers based on complex handcrafted features computed from the raw inputs. Convolutional neural networks (CNNs) are a type of deep model that can act directly on the raw inputs. However, such models are currently limited to handling 2D inputs. In this paper, we develop a novel 3D CNN model for action recognition. This model extracts features from both the spatial and the temporal dimensions by performing 3D convolutions, thereby capturing the motion information encoded in multiple adjacent frames. The developed model generates multiple channels of information from the input frames, and the final feature representation combines information from all channels. To further boost the performance, we propose regularizing the outputs with high-level features and combining the predictions of a variety of different models. We apply the developed models to recognize human actions in the real-world environment of airport surveillance videos, and they achieve superior performance in comparison to baseline methods.

Citing Articles

A bioinspired in-materia analog photoelectronic reservoir computing for human action processing.

Cui H, Xiao Y, Yang Y, Pei M, Ke S, Fang X Nat Commun. 2025; 16(1):2263.

PMID: 40050621 PMC: 11885466. DOI: 10.1038/s41467-025-56899-3.


Improvement in positional accuracy of neural-network predicted hydration sites of proteins by incorporating atomic details of water-protein interactions and site-searching algorithm.

Sato K, Nakasako M Biophys Physicobiol. 2025; 22(1):e220004.

PMID: 40046557 PMC: 11876803. DOI: 10.2142/biophysico.bppb-v22.0004.


Deep learning-based debris flow hazard detection and recognition system: a case study.

Wu F, Zhang J, Liu D, Maier A, Christlein V Sci Rep. 2025; 15(1):6789.

PMID: 40000712 PMC: 11862229. DOI: 10.1038/s41598-025-86471-4.


A deep learning-based system for automatic detection of emesis with high accuracy in Suncus murinus.

Lu Z, Qiao Y, Huang X, Cui D, Liu J, Ngan M Commun Biol. 2025; 8(1):209.

PMID: 39930110 PMC: 11811283. DOI: 10.1038/s42003-025-07479-0.


Digital twin brain simulator for real-time consciousness monitoring and virtual intervention using primate electrocorticogram data.

Takahashi Y, Idei H, Komatsu M, Tani J, Tomita H, Yamashita Y NPJ Digit Med. 2025; 8(1):80.

PMID: 39929926 PMC: 11811282. DOI: 10.1038/s41746-025-01444-1.