Accuracy and Scaling Phenomena in Internet Mapping
Overview
Authors
Affiliations
It was recently argued that sampling a network by traversing it with paths from a small number of sources, as with traceroutes on the Internet, creates a fundamental bias in observed topological features like the degree distribution. We examine this bias analytically and experimentally. For Erdo s-Re nyi random graphs with mean degree c, we show analytically that such sampling gives an observed degree distribution P(k) approximately k(-1) for k less, similarc, despite the underlying distribution being Poissonian. For graphs whose degree distributions have power-law tails P(k) approximately k(-alpha), sampling can significantly underestimate alpha when the graph has a large excess (i.e., many more edges than vertices). We find that in order to accurately estimate alpha, one must use a number of sources which grows linearly in the mean degree of the underlying graph. Finally, we comment on the accuracy of the published values of alpha for the Internet.
Optimization of multiple sampling for solving network boundary specification problem.
Zhang R Sci Rep. 2025; 15(1):4221.
PMID: 39905150 PMC: 11794470. DOI: 10.1038/s41598-025-87760-8.
Measurement error of network clustering coefficients under randomly missing nodes.
Nakajima K, Shudo K Sci Rep. 2021; 11(1):2815.
PMID: 33568743 PMC: 7876024. DOI: 10.1038/s41598-021-82367-1.
Spreading paths in partially observed social networks.
Onnela J, Christakis N Phys Rev E Stat Nonlin Soft Matter Phys. 2012; 85(3 Pt 2):036106.
PMID: 22587148 PMC: 3544412. DOI: 10.1103/PhysRevE.85.036106.
Su X, Jin X, Min Y, Mo L, Yang J PLoS One. 2011; 6(5):e19784.
PMID: 21611192 PMC: 3096638. DOI: 10.1371/journal.pone.0019784.
A model of Internet topology using k-shell decomposition.
Carmi S, Havlin S, Kirkpatrick S, Shavitt Y, Shir E Proc Natl Acad Sci U S A. 2007; 104(27):11150-4.
PMID: 17586683 PMC: 1896135. DOI: 10.1073/pnas.0701175104.