» Articles » PMID: 37966428

Euler Characteristic Curves and Profiles: a Stable Shape Invariant for Big Data Problems

Overview
Journal Gigascience
Specialties Biology
Genetics
Date 2023 Nov 15
PMID 37966428
Authors
Affiliations
Soon will be listed here.
Abstract

Tools of topological data analysis provide stable summaries encapsulating the shape of the considered data. Persistent homology, the most standard and well-studied data summary, suffers a number of limitations; its computations are hard to distribute, and it is hard to generalize to multifiltrations and is computationally prohibitive for big datasets. In this article, we study the concept of Euler characteristics curves for 1-parameter filtrations and Euler characteristic profiles for multiparameter filtrations. While being a weaker invariant in one dimension, we show that Euler characteristic-based approaches do not possess some handicaps of persistent homology; we show efficient algorithms to compute them in a distributed way, their generalization to multifiltrations, and practical applicability for big data problems. In addition, we show that the Euler curves and profiles enjoy a certain type of stability, which makes them robust tools for data analysis. Lastly, to show their practical applicability, multiple use cases are considered.

Citing Articles

Euler characteristic curves and profiles: a stable shape invariant for big data problems.

Dlotko P, Gurnari D Gigascience. 2023; 12.

PMID: 37966428 PMC: 10646871. DOI: 10.1093/gigascience/giad094.

References
1.
Vipond O, Bull J, Macklin P, Tillmann U, Pugh C, Byrne H . Multiparameter persistent homology landscapes identify immune cell spatial patterns in tumors. Proc Natl Acad Sci U S A. 2021; 118(41). PMC: 8522280. DOI: 10.1073/pnas.2102166118. View

2.
Hiraoka Y, Nakamura T, Hirata A, Escolar E, Matsue K, Nishiura Y . Hierarchical structures of amorphous solids characterized by persistent homology. Proc Natl Acad Sci U S A. 2016; 113(26):7035-40. PMC: 4932931. DOI: 10.1073/pnas.1520877113. View

3.
Nicolau M, Levine A, Carlsson G . Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proc Natl Acad Sci U S A. 2011; 108(17):7265-70. PMC: 3084136. DOI: 10.1073/pnas.1102826108. View

4.
Lawson P, Sholl A, Brown J, Fasy B, Wenk C . Persistent Homology for the Quantitative Evaluation of Architectural Features in Prostate Cancer Histology. Sci Rep. 2019; 9(1):1139. PMC: 6361896. DOI: 10.1038/s41598-018-36798-y. View

5.
Chevyrev I, Nanda V, Oberhauser H . Persistence Paths and Signature Features in Topological Data Analysis. IEEE Trans Pattern Anal Mach Intell. 2018; 42(1):192-202. DOI: 10.1109/TPAMI.2018.2885516. View