» Articles » PMID: 39753661

Scene Categorization by Hessian-regularized Active Perceptual Feature Selection

Overview
Journal Sci Rep
Specialty Science
Date 2025 Jan 3
PMID 39753661
Authors
Affiliations
Soon will be listed here.
Abstract

Decoding the semantic categories of complex sceneries is fundamental to numerous artificial intelligence (AI) infrastructures. This work presents an advanced selection of multi-channel perceptual visual features for recognizing scenic images with elaborate spatial structures, focusing on developing a deep hierarchical model dedicated to learning human gaze behavior. Utilizing the BING objectness measure, we efficiently localize objects or their details across varying scales within scenes. To emulate humans observing semantically or visually significant areas within scenes, we propose a robust deep active learning (RDAL) strategy. This strategy progressively generates gaze shifting paths (GSP) and calculates deep GSP representations within a unified architecture. A notable advantage of RDAL is the robustness to label noise, which is implemented by a carefully-designed sparse penalty term. This mechanism ensures that irrelevant or misleading deep GSP features are intelligently discarded. Afterward, a novel Hessian-regularized Feature Selector (HFS) is proposed to select high-quality features from the deep GSP features, wherein (i) the spatial composition of scenic patches can be optimally maintained, and (ii) a linear SVM is learned simultaneously. Empirical evaluations across six standard scenic datasets demonstrated our method's superior performance, highlighting its exceptional ability to differentiate various sophisticated scenery categories.

References
1.
Wang W, Shen J, Dong X, Borji A, Yang R . Inferring Salient Objects from Human Fixations. IEEE Trans Pattern Anal Mach Intell. 2019; 42(8):1913-1927. DOI: 10.1109/TPAMI.2019.2905607. View

2.
Yuan Y, Mou L, Lu X . Scene recognition by manifold regularized deep learning architecture. IEEE Trans Neural Netw Learn Syst. 2015; 26(10):2222-33. DOI: 10.1109/TNNLS.2014.2359471. View

3.
Wang W, Sun G, Gool L . Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning. IEEE Trans Pattern Anal Mach Intell. 2022; 46(3):1635-1649. DOI: 10.1109/TPAMI.2022.3168530. View

4.
Lu X, Li X, Mou L . Semi-Supervised Multitask Learning for Scene Recognition. IEEE Trans Cybern. 2014; 45(9):1967-76. DOI: 10.1109/TCYB.2014.2362959. View

5.
Pont-Tuset J, Arbelaez P, Barron J, Marques F, Malik J . Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation. IEEE Trans Pattern Anal Mach Intell. 2016; 39(1):128-140. DOI: 10.1109/TPAMI.2016.2537320. View