» Articles » PMID: 37619240

Starling: Introducing a Mesoscopic Scale with Confluence for Graph Clustering

Overview
Journal PLoS One
Date 2023 Aug 24
PMID 37619240
Authors
Affiliations
Soon will be listed here.
Abstract

Given a Graph G = (V, E) and two vertices i, j ∈ V, we introduce Confluence(G, i, j), a vertex mesoscopic closeness measure based on short Random walks, which brings together vertices from a same overconnected region of the Graph G, and separates vertices coming from two distinct overconnected regions. Confluence becomes a useful tool for defining a new Clustering quality function QConf(G, Γ) for a given Clustering Γ and for defining a new heuristic Starling to find a partitional Clustering of a Graph G intended to optimize the Clustering quality function QConf. We compare the accuracies of Starling, to the accuracies of three state of the art Graphs Clustering methods: Spectral-Clustering, Louvain, and Infomap. These comparisons are done, on the one hand with artificial Graphs (a) Random Graphs and (b) a classical Graphs Clustering Benchmark, and on the other hand with (c) Terrain-Graphs gathered from real data. We show that with (a), (b) and (c), Starling is always able to obtain equivalent or better accuracies than the three others methods. We show also that with the Benchmark (b), Starling is able to obtain equivalent accuracies and even sometimes better than an Oracle that would only know the expected overconnected regions from the Benchmark, ignoring the concretely constructed edges.

References
1.
Still S, Bialek W . How many clusters? An information-theoretic perspective. Neural Comput. 2004; 16(12):2483-506. DOI: 10.1162/0899766042321751. View

2.
Clauset A, Newman M, Moore C . Finding community structure in very large networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2005; 70(6 Pt 2):066111. DOI: 10.1103/PhysRevE.70.066111. View

3.
Radicchi F, Castellano C, Cecconi F, Loreto V, Parisi D . Defining and identifying communities in networks. Proc Natl Acad Sci U S A. 2004; 101(9):2658-63. PMC: 365677. DOI: 10.1073/pnas.0400054101. View

4.
Lancichinetti A, Fortunato S, Radicchi F . Benchmark graphs for testing community detection algorithms. Phys Rev E Stat Nonlin Soft Matter Phys. 2008; 78(4 Pt 2):046110. DOI: 10.1103/PhysRevE.78.046110. View

5.
Newman M, Girvan M . Finding and evaluating community structure in networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2004; 69(2 Pt 2):026113. DOI: 10.1103/PhysRevE.69.026113. View