» Articles » PMID: 36808085

Topological Data Analysis Identifies Molecular Phenotypes of Idiopathic Pulmonary Fibrosis

Overview
Journal Thorax
Date 2023 Feb 22
PMID 36808085
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Idiopathic pulmonary fibrosis (IPF) is a debilitating, progressive disease with a median survival time of 3-5 years. Diagnosis remains challenging and disease progression varies greatly, suggesting the possibility of distinct subphenotypes.

Methods And Results: We analysed publicly available peripheral blood mononuclear cell expression datasets for 219 IPF, 411 asthma, 362 tuberculosis, 151 healthy, 92 HIV and 83 other disease samples, totalling 1318 patients. We integrated the datasets and split them into train (n=871) and test (n=477) cohorts to investigate the utility of a machine learning model (support vector machine) for predicting IPF. A panel of 44 genes predicted IPF in a background of healthy, tuberculosis, HIV and asthma with an area under the curve of 0.9464, corresponding to a sensitivity of 0.865 and a specificity of 0.89. We then applied topological data analysis to investigate the possibility of subphenotypes within IPF. We identified five molecular subphenotypes of IPF, one of which corresponded to a phenotype enriched for death/transplant. The subphenotypes were molecularly characterised using bioinformatic and pathway analysis tools identifying distinct subphenotype features including one which suggests an extrapulmonary or systemic fibrotic disease.

Conclusions: Integration of multiple datasets, from the same tissue, enabled the development of a model to accurately predict IPF using a panel of 44 genes. Furthermore, topological data analysis identified distinct subphenotypes of patients with IPF which were defined by differences in molecular pathobiology and clinical characteristics.

References
1.
De Meulder B, Lefaudeux D, Bansal A, Mazein A, Chaiboonchoe A, Ahmed H . A computational framework for complex disease stratification from multiple large-scale datasets. BMC Syst Biol. 2018; 12(1):60. PMC: 5975674. DOI: 10.1186/s12918-018-0556-z. View

2.
Yokoyama A, Kohno N, Hamada H, Sakatani M, Ueda E, Kondo K . Circulating KL-6 predicts the outcome of rapidly progressive idiopathic pulmonary fibrosis. Am J Respir Crit Care Med. 1998; 158(5 Pt 1):1680-4. DOI: 10.1164/ajrccm.158.5.9803115. View

3.
Voltz J, Card J, Carey M, Degraff L, Ferguson C, Flake G . Male sex hormones exacerbate lung function impairment after bleomycin-induced pulmonary fibrosis. Am J Respir Cell Mol Biol. 2008; 39(1):45-52. PMC: 2438447. DOI: 10.1165/rcmb.2007-0340OC. View

4.
Fernandez Perez E, Daniels C, Schroeder D, St Sauver J, Hartman T, Bartholmai B . Incidence, prevalence, and clinical course of idiopathic pulmonary fibrosis: a population-based study. Chest. 2009; 137(1):129-37. PMC: 2803118. DOI: 10.1378/chest.09-1002. View

5.
Newman A, Liu C, Green M, Gentles A, Feng W, Xu Y . Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. 2015; 12(5):453-7. PMC: 4739640. DOI: 10.1038/nmeth.3337. View