Identification of Subtypes in Subjects with Mild-to-moderate Airflow Limitation and Its Clinical and Socioeconomic Implications
Overview
Authors
Affiliations
Purpose: The purpose of this study was to identify subtypes in patients with mild-to-moderate airflow limitation and to appreciate their clinical and socioeconomic implications.
Methods: Subjects who were aged ≥20 years and had forced expiratory volume in 1 second (FEV) ≥60% predicted and FEV/forced vital capacity <0.7 were selected from the fourth Korea National Health and Nutrition Examination Survey (KNHANES) in 2007-2012. The data were merged to the National Health Insurance reimbursement database during the same period. k-Means clustering was performed to explore subtypes. For clustering analysis, six key input variables - age, body mass index (BMI), FEV% predicted, the presence or absence of self-reported wheezing, smoking status, and pack-years of smoking - were selected.
Results: Among a total of 2,140 subjects, five groups were identified through k-means clustering, namely putative "near-normal (n=232)," "asthmatic (n=392)," "chronic obstructive pulmonary disease (COPD) (n=37)," "asthmatic-overlap (n=893)," and "COPD-overlap (n=586)" subtypes. Near-normal group showed the oldest mean age (72±7 years) and highest FEV (102%±8% predicted), and asthmatic group was the youngest (46±9 years). COPD and COPD-overlap groups were male predominant and all current or ex-smokers. While asthmatic group had the lowest prescription rate despite the highest proportion of self-reported wheezing, COPD, asthmatic-overlap, and COPD-overlap groups showed high prescription rate of respiratory medicine. Although COPD group formed only 1.7% of total subjects, they showed the highest mean medical cost and health care utilization, comprising 5.3% of the total medical cost. When calculating a ratio of total medical expense to household income, the mean ratio was highest in the COPD group.
Conclusion: Clinical and epidemiological heterogeneities of subjects with mild-to-moderate airflow limitation and a different level of health care utilization by each subtype are shown. Identification of a subtype with high health care demand could be a priority for effective utilization of limited resources.
Jo Y Tuberc Respir Dis (Seoul). 2021; 85(2):101-110.
PMID: 34871477 PMC: 8987665. DOI: 10.4046/trd.2021.0140.
A Systematic Review of Asthma Phenotypes Derived by Data-Driven Methods.
Cunha F, Amaral R, Jacinto T, Sousa-Pinto B, Fonseca J Diagnostics (Basel). 2021; 11(4).
PMID: 33918233 PMC: 8066118. DOI: 10.3390/diagnostics11040644.
Simulation-derived best practices for clustering clinical data.
Coombes C, Liu X, Abrams Z, Coombes K, Brock G J Biomed Inform. 2021; 118:103788.
PMID: 33862229 PMC: 9017600. DOI: 10.1016/j.jbi.2021.103788.
Unsupervised machine learning and prognostic factors of survival in chronic lymphocytic leukemia.
Coombes C, Abrams Z, Li S, Abruzzo L, Coombes K J Am Med Inform Assoc. 2020; 27(7):1019-1027.
PMID: 32483590 PMC: 7647286. DOI: 10.1093/jamia/ocaa060.
Nissen F, Douglas I, Mullerova H, Pearce N, Bloom C, Smeeth L J Asthma Allergy. 2019; 12:7-19.
PMID: 30662273 PMC: 6329349. DOI: 10.2147/JAA.S182013.