» Articles » PMID: 15244698

Estimating Mutual Information

Overview
Date 2004 Jul 13
PMID 15244698
Citations 510
Authors
Affiliations
Soon will be listed here.
Abstract

We present two classes of improved estimators for mutual information M(X,Y), from samples of random points distributed according to some joint probability density mu(x,y). In contrast to conventional estimators based on binnings, they are based on entropy estimates from k -nearest neighbor distances. This means that they are data efficient (with k=1 we resolve structures down to the smallest possible scales), adaptive (the resolution is higher where data are more numerous), and have minimal bias. Indeed, the bias of the underlying entropy estimates is mainly due to nonuniformity of the density at the smallest resolved scale, giving typically systematic errors which scale as functions of k/N for N points. Numerically, we find that both families become exact for independent distributions, i.e. the estimator M(X,Y) vanishes (up to statistical fluctuations) if mu(x,y)=mu(x)mu(y). This holds for all tested marginal distributions and for all dimensions of x and y. In addition, we give estimators for redundancies between more than two random variables. We compare our algorithms in detail with existing algorithms. Finally, we demonstrate the usefulness of our estimators for assessing the actual independence of components obtained from independent component analysis (ICA), for improving ICA, and for estimating the reliability of blind source separation.

Citing Articles

An explainable GeoAI approach for the multimodal analysis of urban human dynamics: a case study for the COVID-19 pandemic in Rio de Janeiro.

Hanny D, Arifi D, Knoblauch S, Resch B, Lautenbach S, Zipf A Comput Urban Sci. 2025; 5(1):13.

PMID: 40046777 PMC: 11876275. DOI: 10.1007/s43762-025-00172-2.


Optimizing functional brain network analysis by incorporating nonlinear factors and frequency band selection with machine learning models.

Hu K, Zhong B, Tian R, Yao J Medicine (Baltimore). 2025; 104(9):e41667.

PMID: 40020107 PMC: 11875576. DOI: 10.1097/MD.0000000000041667.


Stacking Ensemble Deep Learning for Real-Time Intrusion Detection in IoMT Environments.

Alalwany E, Alsharif B, Alotaibi Y, Alfahaid A, Mahgoub I, Ilyas M Sensors (Basel). 2025; 25(3).

PMID: 39943263 PMC: 11821146. DOI: 10.3390/s25030624.


Self-driving lab for the photochemical synthesis of plasmonic nanoparticles with targeted structural and optical properties.

Wu T, Kheiri S, Hickman R, Tao H, Wu T, Yang Z Nat Commun. 2025; 16(1):1473.

PMID: 39922810 PMC: 11807174. DOI: 10.1038/s41467-025-56788-9.


Classification of short-term flood events using stochastic variable selection and Gaussian Naïve Bayes classifier: A case study of Sirajganj district, Bangladesh.

Mondal C, Uddin M Heliyon. 2025; 11(2):e41941.

PMID: 39897862 PMC: 11787522. DOI: 10.1016/j.heliyon.2025.e41941.