» Articles » PMID: 16566508

One-shot Learning of Object Categories

Overview
Date 2006 Mar 29
PMID 16566508
Citations 101
Authors
Affiliations
Soon will be listed here.
Abstract

Learning visual models of object categories notoriously requires hundreds or thousands of training examples. We show that it is possible to learn much information about a category from just one, or a handful, of images. The key insight is that, rather than learning from scratch, one can take advantage of knowledge coming from previously learned categories, no matter how different these categories might be. We explore a Bayesian implementation of this idea. Object categories are represented by probabilistic models. Prior knowledge is represented as a probability density function on the parameters of these models. The posterior model for an object category is obtained by updating the prior in the light of one or more observations. We test a simple implementation of our algorithm on a database of 101 diverse object categories. We compare category models learned by an implementation of our Bayesian approach to models learned from by Maximum Likelihood (ML) and Maximum A Posteriori (MAP) methods. We find that on a database of more than 100 categories, the Bayesian approach produces informative models when the number of training examples is too small for other methods to operate successfully.

Citing Articles

JSE: Joint Semantic Encoder for zero-shot gesture learning.

Madapana N, Wachs J Pattern Anal Appl. 2024; 25(3):679-692.

PMID: 39588314 PMC: 11588148. DOI: 10.1007/s10044-021-00992-y.


Early screening of miliary tuberculosis with tuberculous meningitis based on few-shot learning with multiple windows and feature granularities.

Tian Y, Liang Y, Chen Y, Li L, Bian H Sci Rep. 2024; 14(1):23620.

PMID: 39384848 PMC: 11464817. DOI: 10.1038/s41598-024-75253-z.


Few-Shot Learning in Wi-Fi-Based Indoor Positioning.

Xie F, Lam S, Xie M, Wang C Biomimetics (Basel). 2024; 9(9).

PMID: 39329573 PMC: 11430087. DOI: 10.3390/biomimetics9090551.


A Meta-Learning Approach for Classifying Multimodal Retinal Images of Retinal Vein Occlusion With Limited Data.

Jiachu D, Luo L, Xie M, Xie X, Guo J, Ye H Transl Vis Sci Technol. 2024; 13(9):22.

PMID: 39297809 PMC: 11421671. DOI: 10.1167/tvst.13.9.22.


Elements of episodic memory: insights from artificial agents.

Boyle A, Blomkvist A Philos Trans R Soc Lond B Biol Sci. 2024; 379(1913):20230416.

PMID: 39278254 PMC: 11449156. DOI: 10.1098/rstb.2023.0416.