» Articles » PMID: 39588314

JSE: Joint Semantic Encoder for Zero-shot Gesture Learning

Overview
Date 2024 Nov 26
PMID 39588314
Authors
Affiliations
Soon will be listed here.
Abstract

Zero-shot learning (ZSL) is a transfer learning paradigm that aims to recognize unseen categories just by having a high-level description of them. While deep learning has greatly pushed the limits of ZSL for object classification, ZSL for gesture recognition (ZSGL) remains largely unexplored. Previous attempts to address ZSGL were focused on the creation of gesture attributes and algorithmic improvements, and there is little or no research concerned with feature selection for ZSGL. It is indisputable that deep learning has obviated the need for feature engineering for problems with large datasets. However, when the data are scarce, it is critical to leverage the domain information to create discriminative input features. The main goal of this work is to study the effect of three different feature extraction techniques (, and features) on the performance of ZSGL. In addition, we propose a bilinear auto-encoder approach, referred to as Joint Semantic Encoder (JSE), for ZSGL that jointly minimizes the reconstruction, semantic and classification losses. We conducted extensive experiments to compare and contrast the feature extraction techniques and to evaluate the performance of JSE with respect to existing ZSL methods. For classification scenario, irrespective of the feature type, results showed that JSE outperforms other approaches by 5% (<0.01). When JSE is trained with features in condition, we showed that JSE significantly outperforms other methods by 5% (<0.01)).

References
1.
Fu Y, Hospedales T, Xiang T, Gong S . Transductive multi-view zero-shot learning. IEEE Trans Pattern Anal Mach Intell. 2015; 37(11):2332-45. DOI: 10.1109/TPAMI.2015.2408354. View

2.
Massaroni C, Giurazza F, Tesei M, Schena E, Corvino F, Meneo M . A Touchless system for image visualization during surgery: preliminary experience in clinical settings. Annu Int Conf IEEE Eng Med Biol Soc. 2018; 2018:5794-5797. DOI: 10.1109/EMBC.2018.8513631. View

3.
Lampert C, Nickisch H, Harmeling S . Attribute-based classification for zero-shot visual object categorization. IEEE Trans Pattern Anal Mach Intell. 2014; 36(3):453-65. DOI: 10.1109/TPAMI.2013.140. View

4.
Madapana N, Gonzalez G, Rodgers R, Zhang L, Wachs J . Gestures for Picture Archiving and Communication Systems (PACS) operation in the operating room: Is there any standard?. PLoS One. 2018; 13(6):e0198092. PMC: 5997313. DOI: 10.1371/journal.pone.0198092. View

5.
Rahman S, Khan S, Porikli F . A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning. IEEE Trans Image Process. 2018; . DOI: 10.1109/TIP.2018.2861573. View