» Articles » PMID: 20233420

Cluster Analysis in Severe Emphysema Subjects Using Phenotype and Genotype Data: an Exploratory Investigation

Overview
Journal Respir Res
Specialty Pulmonary Medicine
Date 2010 Mar 18
PMID 20233420
Citations 30
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Numerous studies have demonstrated associations between genetic markers and COPD, but results have been inconsistent. One reason may be heterogeneity in disease definition. Unsupervised learning approaches may assist in understanding disease heterogeneity.

Methods: We selected 31 phenotypic variables and 12 SNPs from five candidate genes in 308 subjects in the National Emphysema Treatment Trial (NETT) Genetics Ancillary Study cohort. We used factor analysis to select a subset of phenotypic variables, and then used cluster analysis to identify subtypes of severe emphysema. We examined the phenotypic and genotypic characteristics of each cluster.

Results: We identified six factors accounting for 75% of the shared variability among our initial phenotypic variables. We selected four phenotypic variables from these factors for cluster analysis: 1) post-bronchodilator FEV1 percent predicted, 2) percent bronchodilator responsiveness, and quantitative CT measurements of 3) apical emphysema and 4) airway wall thickness. K-means cluster analysis revealed four clusters, though separation between clusters was modest: 1) emphysema predominant, 2) bronchodilator responsive, with higher FEV1; 3) discordant, with a lower FEV1 despite less severe emphysema and lower airway wall thickness, and 4) airway predominant. Of the genotypes examined, membership in cluster 1 (emphysema-predominant) was associated with TGFB1 SNP rs1800470.

Conclusions: Cluster analysis may identify meaningful disease subtypes and/or groups of related phenotypic variables even in a highly selected group of severe emphysema subjects, and may be useful for genetic association studies.

Citing Articles

Seasonality of acute kidney injury phenotypes in England: an unsupervised machine learning classification study of electronic health records.

Bolt H, Suffel A, Matthewman J, Sandmann F, Tomlinson L, Eggo R BMC Nephrol. 2023; 24(1):234.

PMID: 37558976 PMC: 10413486. DOI: 10.1186/s12882-023-03269-0.


Heterogeneity and Progression of Chronic Obstructive Pulmonary Disease: Emphysema-Predominant and Non-Emphysema-Predominant Disease.

Castaldi P, Xu Z, Young K, Hokanson J, Lynch D, Humphries S Am J Epidemiol. 2023; 192(10):1647-1658.

PMID: 37160347 PMC: 11063557. DOI: 10.1093/aje/kwad114.


Subtyping hospitalized patients with hypokalemia by machine learning consensus clustering and associated mortality risks.

Thongprayoon C, Mao M, Kattah A, Keddis M, Pattharanitima P, Erickson S Clin Kidney J. 2022; 15(2):253-261.

PMID: 35145640 PMC: 8825225. DOI: 10.1093/ckj/sfab190.


The value of bronchial and cavity contraction rates in differentiating benign and malignant pulmonary cavities.

Zhang H, Qian X, Liu Z, Gong Y BMC Pulm Med. 2020; 20(1):208.

PMID: 32762669 PMC: 7409678. DOI: 10.1186/s12890-020-01238-z.


Machine Learning Characterization of COPD Subtypes: Insights From the COPDGene Study.

Castaldi P, Boueiz A, Yun J, San Jose Estepar R, Ross J, Washko G Chest. 2019; 157(5):1147-1157.

PMID: 31887283 PMC: 7242638. DOI: 10.1016/j.chest.2019.11.039.


References
1.
Suthanthiran M, Li B, Song J, Ding R, Sharma V, Schwartz J . Transforming growth factor-beta 1 hyperexpression in African-American hypertensives: A novel mediator of hypertension and/or target organ damage. Proc Natl Acad Sci U S A. 2000; 97(7):3479-84. PMC: 16265. DOI: 10.1073/pnas.97.7.3479. View

2.
Castaldi P, Cho M, Cohn M, Langerman F, Moran S, Tarragona N . The COPD genetic association compendium: a comprehensive online database of COPD genetic associations. Hum Mol Genet. 2009; 19(3):526-34. PMC: 2798725. DOI: 10.1093/hmg/ddp519. View

3.
Taube C, Lehnigk B, Paasch K, Kirsten D, Jorres R, Magnussen H . Factor analysis of changes in dyspnea and lung function parameters after bronchodilation in chronic obstructive pulmonary disease. Am J Respir Crit Care Med. 2000; 162(1):216-20. DOI: 10.1164/ajrccm.162.1.9909054. View

4.
Sandford A, Chagani T, Weir T, Connett J, Anthonisen N, Pare P . Susceptibility genes for rapid decline of lung function in the lung health study. Am J Respir Crit Care Med. 2001; 163(2):469-73. DOI: 10.1164/ajrccm.163.2.2006158. View

5.
Pauwels R, Buist A, Calverley P, Jenkins C, Hurd S . Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease. NHLBI/WHO Global Initiative for Chronic Obstructive Lung Disease (GOLD) Workshop summary. Am J Respir Crit Care Med. 2001; 163(5):1256-76. DOI: 10.1164/ajrccm.163.5.2101039. View