» Articles » PMID: 20080025

Of Bits and Wows: A Bayesian Theory of Surprise with Applications to Attention

Overview
Journal Neural Netw
Specialties Biology
Neurology
Date 2010 Jan 19
PMID 20080025
Citations 62
Authors
Affiliations
Soon will be listed here.
Abstract

The amount of information contained in a piece of data can be measured by the effect this data has on its observer. Fundamentally, this effect is to transform the observer's prior beliefs into posterior beliefs, according to Bayes theorem. Thus the amount of information can be measured in a natural way by the distance (relative entropy) between the prior and posterior distributions of the observer over the available space of hypotheses. This facet of information, termed "surprise", is important in dynamic situations where beliefs change, in particular during learning and adaptation. Surprise can often be computed analytically, for instance in the case of distributions from the exponential family, or it can be numerically approximated. During sequential Bayesian learning, surprise decreases as the inverse of the number of training examples. Theoretical properties of surprise are discussed, in particular how it differs and complements Shannon's definition of information. A computer vision neural network architecture is then presented capable of computing surprise over images and video stimuli. Hypothesizing that surprising data ought to attract natural or artificial attention systems, the output of this architecture is used in a psychophysical experiment to analyze human eye movements in the presence of natural video stimuli. Surprise is found to yield robust performance at predicting human gaze (ROC-like ordinal dominance score approximately 0.7 compared to approximately 0.8 for human inter-observer repeatability, approximately 0.6 for simpler intensity contrast-based predictor, and 0.5 for chance). The resulting theory of surprise is applicable across different spatio-temporal scales, modalities, and levels of abstraction.

Citing Articles

P300 as an index of speech-in-noise understanding in complex acoustic environments in young and older adults.

Pearson D, Shen Y, Hetrick W, ODonnell B, Lundin N, McAuley J Front Neurosci. 2025; 19:1497781.

PMID: 40046437 PMC: 11879943. DOI: 10.3389/fnins.2025.1497781.


Brain network dynamics predict moments of surprise across contexts.

Zhang Z, Rosenberg M Nat Hum Behav. 2024; .

PMID: 39715875 DOI: 10.1038/s41562-024-02017-0.


Temporal dynamics of uncertainty and prediction error in musical improvisation across different periods.

Daikoku T Sci Rep. 2024; 14(1):22297.

PMID: 39333792 PMC: 11437158. DOI: 10.1038/s41598-024-73689-x.


Neurons of Macaque Frontal Eye Field Signal Reward-Related Surprise.

Shteyn M, Olson C J Neurosci. 2024; 44(38).

PMID: 39107059 PMC: 11411596. DOI: 10.1523/JNEUROSCI.0441-24.2024.


Model-Based Approaches to Investigating Mismatch Responses in Schizophrenia.

Gutlin D, McDermott H, Grundei M, Auksztulewicz R Clin EEG Neurosci. 2024; 56(1):8-21.

PMID: 38751125 PMC: 11664892. DOI: 10.1177/15500594241253910.


References
1.
Renninger L, Coughlan J, Verghese P, Malik J . An information maximization model of eye movements. Adv Neural Inf Process Syst. 2005; 17:1121-8. View

2.
Najemnik J, Geisler W . Optimal eye movement strategies in visual search. Nature. 2005; 434(7031):387-91. DOI: 10.1038/nature03390. View

3.
Li Z . A saliency map in primary visual cortex. Trends Cogn Sci. 2002; 6(1):9-16. DOI: 10.1016/s1364-6613(00)01817-9. View

4.
Hubel D, Wiesel T . Receptive fields, binocular interaction and functional architecture in the cat's visual cortex. J Physiol. 1962; 160:106-54. PMC: 1359523. DOI: 10.1113/jphysiol.1962.sp006837. View

5.
Mannan S, Ruddock K, Wooding D . The relationship between the locations of spatial features and those of fixations made during visual examination of briefly presented images. Spat Vis. 1996; 10(3):165-88. DOI: 10.1163/156856896x00123. View