Watch-n-Patch: Unsupervised Learning of Actions and Relations
Overview
Affiliations
There is a large variation in the activities that humans perform in their everyday lives. We consider modeling these composite human activities which comprises multiple basic level actions in a completely unsupervised setting. Our model learns high-level co-occurrence and temporal relations between the actions. We consider the video as a sequence of short-term action clips, which contains human-words and object-words. An activity is about a set of action-topics and object-topics indicating which actions are present and which objects are interacting with. We then propose a new probabilistic model relating the words and the topics. It allows us to model long-range action relations that commonly exist in the composite activities, which is challenging in previous works. We apply our model to the unsupervised action segmentation and clustering, and to a novel application that detects forgotten actions, which we call action patching. For evaluation, we contribute a new challenging RGB-D activity video dataset recorded by the new Kinect v2, which contains several human daily activities as compositions of multiple actions interacting with different objects. Moreover, we develop a robotic system that watches and reminds people using our action patching algorithm. Our robotic setup can be easily deployed on any assistive robots.
Jovanovic M, Mitrov G, Zdravevski E, Lameski P, Colantonio S, Kampel M J Med Internet Res. 2022; 24(11):e36553.
PMID: 36331530 PMC: 9675018. DOI: 10.2196/36553.
Localized Trajectories for 2D and 3D Action Recognition.
Papadopoulos K, Demisse G, Ghorbel E, Antunes M, Aouada D, Ottersten B Sensors (Basel). 2019; 19(16).
PMID: 31405153 PMC: 6720755. DOI: 10.3390/s19163503.
Memory and mental time travel in humans and social robots.
Prescott T, Camilleri D, Martinez-Hernandez U, Damianou A, Lawrence N Philos Trans R Soc Lond B Biol Sci. 2019; 374(1771):20180025.
PMID: 30852998 PMC: 6452248. DOI: 10.1098/rstb.2018.0025.