» Articles » PMID: 38685479

Novel Machine Learning Identifies 5 Asthma Phenotypes Using Cluster Analysis of Real-World Data

Abstract

Background: Asthma classification into different subphenotypes is important to guide personalized therapy and improve outcomes.

Objectives: To further explore asthma heterogeneity through determination of multiple patient groups by using novel machine learning (ML) approaches and large-scale real-world data.

Methods: We used electronic health records of patients with asthma followed at the Cleveland Clinic between 2010 and 2021. We used k-prototype unsupervised ML to develop a clustering model where predictors were age, sex, race, body mass index, prebronchodilator and postbronchodilator spirometry measurements, and the usage of inhaled/systemic steroids. We applied elbow and silhouette plots to select the optimal number of clusters. These clusters were then evaluated through LightGBM's supervised ML approach on their cross-validated F1 score to support their distinctiveness.

Results: Data from 13,498 patients with asthma with available postbronchodilator spirometry measurements were extracted to identify 5 stable clusters. Cluster 1 included a young nonsevere asthma population with normal lung function and higher frequency of acute exacerbation (0.8 /patient-year). Cluster 2 had the highest body mass index (mean ± SD, 44.44 ± 7.83 kg/m), and the highest proportion of females (77.5%) and Blacks (28.9%). Cluster 3 comprised patients with normal lung function. Cluster 4 included patients with lower percent of predicted FEV of 77.03 (12.79) and poor response to bronchodilators. Cluster 5 had the lowest percent of predicted FEV of 68.08 (15.02), the highest postbronchodilator reversibility, and the highest proportion of severe asthma (44.9%) and blood eosinophilia (>300 cells/μL) (34.8%).

Conclusions: Using real-world data and unsupervised ML, we classified asthma into 5 clinically important subphenotypes where group-specific asthma treatment and management strategies can be designed and deployed.

References
1.
Zein J, Wu C, Attaway A, Zhang P, Nazha A . Novel Machine Learning Can Predict Acute Asthma Exacerbation. Chest. 2021; 159(5):1747-1757. PMC: 8129731. DOI: 10.1016/j.chest.2020.12.051. View

2.
Loureiro C, Sa-Couto P, Todo-Bom A, Bousquet J . Cluster analysis in phenotyping a Portuguese population. Rev Port Pneumol (2006). 2015; . DOI: 10.1016/j.rppnen.2015.07.006. View

3.
Zhang P, Zein J . Novel Insights on Sex-Related Differences in Asthma. Curr Allergy Asthma Rep. 2019; 19(10):44. DOI: 10.1007/s11882-019-0878-y. View

4.
Ortega H, Liu M, Pavord I, Brusselle G, FitzGerald J, Chetta A . Mepolizumab treatment in patients with severe eosinophilic asthma. N Engl J Med. 2014; 371(13):1198-207. DOI: 10.1056/NEJMoa1403290. View

5.
Haldar P, Pavord I, Shaw D, Berry M, Thomas M, Brightling C . Cluster analysis and clinical asthma phenotypes. Am J Respir Crit Care Med. 2008; 178(3):218-224. PMC: 3992366. DOI: 10.1164/rccm.200711-1754OC. View