» Articles » PMID: 14757820

Tracking Evolving Communities in Large Linked Networks

Overview
Specialty Science
Date 2004 Feb 6
PMID 14757820
Citations 19
Authors
Affiliations
Soon will be listed here.
Abstract

We are interested in tracking changes in large-scale data by periodically creating an agglomerative clustering and examining the evolution of clusters (communities) over time. We examine a large real-world data set: the NEC CiteSeer database, a linked network of >250,000 papers. Tracking changes over time requires a clustering algorithm that produces clusters stable under small perturbations of the input data. However, small perturbations of the CiteSeer data lead to significant changes to most of the clusters. One reason for this is that the order in which papers within communities are combined is somewhat arbitrary. However, certain subsets of papers, called natural communities, correspond to real structure in the CiteSeer database and thus appear in any clustering. By identifying the subset of clusters that remain stable under multiple clustering runs, we get the set of natural communities that we can track over time. We demonstrate that such natural communities allow us to identify emerging communities and track temporal changes in the underlying structure of our network data.

Citing Articles

Network analysis of pig movement data as an epidemiological tool: an Austrian case study.

Puspitarani G, Fuchs R, Fuchs K, Ladinig A, Desvars-Larrive A Sci Rep. 2023; 13(1):9623.

PMID: 37316653 PMC: 10267221. DOI: 10.1038/s41598-023-36596-1.


Correlation networks of air particulate matter ( ): a comparative study.

Vlachogiannis D, Xu Y, Jin L, Gonzalez M Appl Netw Sci. 2021; 6(1):32.

PMID: 33907706 PMC: 8062950. DOI: 10.1007/s41109-021-00373-8.


TriRNSC: triclustering of gene expression microarray data using restricted neighbourhood search.

Biswal B, Patra S, Mohapatra A, Vipsita S IET Syst Biol. 2021; 14(6):323-333.

PMID: 33399096 PMC: 8687346. DOI: 10.1049/iet-syb.2020.0024.


Analysis of group evolution prediction in complex networks.

Saganowski S, Brodka P, Koziarski M, Kazienko P PLoS One. 2019; 14(10):e0224194.

PMID: 31661495 PMC: 6818769. DOI: 10.1371/journal.pone.0224194.


Tracking online topics over time: understanding dynamic hashtag communities.

Lorenz-Spreen P, Wolf F, Braun J, Ghoshal G, Djurdjevac Conrad N, Hovel P Comput Soc Netw. 2018; 5(1):9.

PMID: 30416936 PMC: 6208799. DOI: 10.1186/s40649-018-0058-6.


References
1.
Watts D, Strogatz S . Collective dynamics of 'small-world' networks. Nature. 1998; 393(6684):440-2. DOI: 10.1038/30918. View

2.
Borner K, Maru J, Goldstone R . The simultaneous evolution of author and paper networks. Proc Natl Acad Sci U S A. 2004; 101 Suppl 1:5266-73. PMC: 387306. DOI: 10.1073/pnas.0307625100. View