Of Bits and Wows: A Bayesian Theory of Surprise with Applications to Attention

Overview

Journal Neural Netw

Specialties Biology
Neurology

Date 2010 Jan 19

PMID 20080025

Citations 62

Authors

Pierre Baldi

Laurent Itti

Affiliations

Soon will be listed here.

Abstract

The amount of information contained in a piece of data can be measured by the effect this data has on its observer. Fundamentally, this effect is to transform the observer's prior beliefs into posterior beliefs, according to Bayes theorem. Thus the amount of information can be measured in a natural way by the distance (relative entropy) between the prior and posterior distributions of the observer over the available space of hypotheses. This facet of information, termed "surprise", is important in dynamic situations where beliefs change, in particular during learning and adaptation. Surprise can often be computed analytically, for instance in the case of distributions from the exponential family, or it can be numerically approximated. During sequential Bayesian learning, surprise decreases as the inverse of the number of training examples. Theoretical properties of surprise are discussed, in particular how it differs and complements Shannon's definition of information. A computer vision neural network architecture is then presented capable of computing surprise over images and video stimuli. Hypothesizing that surprising data ought to attract natural or artificial attention systems, the output of this architecture is used in a psychophysical experiment to analyze human eye movements in the presence of natural video stimuli. Surprise is found to yield robust performance at predicting human gaze (ROC-like ordinal dominance score approximately 0.7 compared to approximately 0.8 for human inter-observer repeatability, approximately 0.6 for simpler intensity contrast-based predictor, and 0.5 for chance). The resulting theory of surprise is applicable across different spatio-temporal scales, modalities, and levels of abstraction.

Citing Articles

P300 as an index of speech-in-noise understanding in complex acoustic environments in young and older adults.

Pearson D, Shen Y, Hetrick W, ODonnell B, Lundin N, McAuley J Front Neurosci. 2025; 19:1497781.

PMID: 40046437 PMC: 11879943. DOI: 10.3389/fnins.2025.1497781.

Brain network dynamics predict moments of surprise across contexts.

Zhang Z, Rosenberg M Nat Hum Behav. 2024; .

PMID: 39715875 DOI: 10.1038/s41562-024-02017-0.

Temporal dynamics of uncertainty and prediction error in musical improvisation across different periods.

Daikoku T Sci Rep. 2024; 14(1):22297.

PMID: 39333792 PMC: 11437158. DOI: 10.1038/s41598-024-73689-x.

Neurons of Macaque Frontal Eye Field Signal Reward-Related Surprise.

Shteyn M, Olson C J Neurosci. 2024; 44(38).

PMID: 39107059 PMC: 11411596. DOI: 10.1523/JNEUROSCI.0441-24.2024.

Model-Based Approaches to Investigating Mismatch Responses in Schizophrenia.

Gutlin D, McDermott H, Grundei M, Auksztulewicz R Clin EEG Neurosci. 2024; 56(1):8-21.

PMID: 38751125 PMC: 11664892. DOI: 10.1177/15500594241253910.

References

Renninger L, Coughlan J, Verghese P, Malik J . An information maximization model of eye movements. Adv Neural Inf Process Syst. 2005; 17:1121-8. View

Najemnik J, Geisler W . Optimal eye movement strategies in visual search. Nature. 2005; 434(7031):387-91. DOI: 10.1038/nature03390. View

Li Z . A saliency map in primary visual cortex. Trends Cogn Sci. 2002; 6(1):9-16. DOI: 10.1016/s1364-6613(00)01817-9. View

Hubel D, Wiesel T . Receptive fields, binocular interaction and functional architecture in the cat's visual cortex. J Physiol. 1962; 160:106-54. PMC: 1359523. DOI: 10.1113/jphysiol.1962.sp006837. View

Mannan S, Ruddock K, Wooding D . The relationship between the locations of spatial features and those of fixations made during visual examination of briefly presented images. Spat Vis. 1996; 10(3):165-88. DOI: 10.1163/156856896x00123. View

Suder K, Worgotter F . The control of low-level information flow in the visual system. Rev Neurosci. 2000; 11(2-3):127-46. DOI: 10.1515/revneuro.2000.11.2-3.127. View

Ranganath C, Rainer G . Neural mechanisms for detecting and remembering novel events. Nat Rev Neurosci. 2003; 4(3):193-202. DOI: 10.1038/nrn1052. View

Itti L, Baldi P . Bayesian surprise attracts human attention. Vision Res. 2008; 49(10):1295-306. PMC: 2782645. DOI: 10.1016/j.visres.2008.09.007. View

Carmi R, Itti L . The role of memory in guiding attention during natural vision. J Vis. 2006; 6(9):898-914. DOI: 10.1167/6.9.4. View

10.

Wolfe J, Horowitz T . What attributes guide the deployment of visual attention and how do they do it?. Nat Rev Neurosci. 2004; 5(6):495-501. DOI: 10.1038/nrn1411. View

11.

Reinagel P, Zador A . Natural scene statistics at the centre of gaze. Network. 2000; 10(4):341-50. View

12.

Itti L, Koch C . A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Res. 2000; 40(10-12):1489-506. DOI: 10.1016/s0042-6989(99)00163-7. View

13.

Weaver W . Probability, rarity, interest, and surprise. Sci Mon. 1948; 67(6):390-2. View

14.

Itti L, Koch C . Computational modelling of visual attention. Nat Rev Neurosci. 2001; 2(3):194-203. DOI: 10.1038/35058500. View

15.

Softky W, Koch C . The highly irregular firing of cortical cells is inconsistent with temporal integration of random EPSPs. J Neurosci. 1993; 13(1):334-50. PMC: 6576320. View

16.

Muller J, Metha A, Krauskopf J, Lennie P . Rapid adaptation in visual cortex to the structure of images. Science. 1999; 285(5432):1405-8. DOI: 10.1126/science.285.5432.1405. View

17.

Grossberg S, Raizada R . Contrast-sensitive perceptual grouping and object-based attention in the laminar circuits of primary visual cortex. Vision Res. 2000; 40(10-12):1413-32. DOI: 10.1016/s0042-6989(99)00229-1. View

18.

Wong A, You M . Entropy and distance of random graphs with application to structural pattern recognition. IEEE Trans Pattern Anal Mach Intell. 2011; 7(5):599-609. DOI: 10.1109/tpami.1985.4767707. View

19.

Zhaoping L, May K . Psychophysical tests of the hypothesis of a bottom-up saliency map in primary visual cortex. PLoS Comput Biol. 2007; 3(4):e62. PMC: 1847698. DOI: 10.1371/journal.pcbi.0030062. View

20.

Finney S . Real-time data collection in Linux: a case study. Behav Res Methods Instrum Comput. 2001; 33(2):167-73. DOI: 10.3758/bf03195362. View