» Articles » PMID: 33002337

Data-Driven Patient Clustering and Differential Clinical Outcomes in the Brigham and Women's Rheumatoid Arthritis Sequential Study Registry

Abstract

Objective: To use unbiased, data-driven, principal component (PC) and cluster analysis to identify patient phenotypes of rheumatoid arthritis (RA) that might exhibit distinct trajectories of disease progression, response to treatment, and risk for adverse events.

Methods: Patient demographic, socioeconomic, health, and disease characteristics recorded at entry into a large, single-center, prospective observational registry cohort, the Brigham and Women's Rheumatoid Arthritis Sequential Study (BRASS), were harmonized using PC analysis to reduce dimensionality and collinearity. The number of PCs was established by eigenvalue >1, cumulative variance, and interpretability. The resulting PCs were used to cluster patients using a K-means approach. Longitudinal clinical outcomes were compared between the clusters over 2 years.

Results: Analysis of 142 variables from 1,443 patients identified 41 PCs that accounted for 77% of the cumulative variance in the data set. Cluster analysis distinguished 5 patient clusters: 1) less RA disease activity/multimorbidity, shorter RA duration, lower incidence of comorbidities; 2) less RA disease activity/multimorbidity, longer RA duration, more infections, psychiatric comorbidities, health care utilization; 3) moderate RA disease activity/multimorbidity, more neurologic comorbidity; 4) more RA disease activity/multimorbidity, shorter RA duration, more metabolic comorbidity, higher body mass index; 5) more RA disease activity/multimorbidity, longer RA duration, more hepatic, orthopedic comorbidity and RA-related surgeries. The clusters exhibited differences in clinical outcomes over 2 years of follow-up.

Conclusion: Data-driven analysis of the BRASS registry identified 5 distinct phenotypes of RA. These results illustrate the potential of data-driven patient profiling as a tool to support personalized medicine in RA. Validation in an independent data set is ongoing.

Citing Articles

A robust clustering strategy for stratification unveils unique patient subgroups in acutely decompensated cirrhosis.

Palomino-Echeverria S, Huergo E, Ortega-Legarreta A, Uson Raposo E, Aguilar F, de la Pena-Ramirez C J Transl Med. 2024; 22(1):599.

PMID: 38937846 PMC: 11210156. DOI: 10.1186/s12967-024-05386-2.


Comorbidity clusters in patients with rheumatoid arthritis identify a patient phenotype with a favourable prognosis.

Crowson C, Atkinson E, Kronzer V, Kimbrough B, Arment C, Peterson L Ann Rheum Dis. 2024; 83(5):556-563.

PMID: 38331589 PMC: 11017091. DOI: 10.1136/ard-2023-225093.


Assessing clusters of comorbidities in rheumatoid arthritis: a machine learning approach.

Solomon D, Guan H, Johansson F, Santacroce L, Malley W, Guo L Arthritis Res Ther. 2023; 25(1):224.

PMID: 37993918 PMC: 10664370. DOI: 10.1186/s13075-023-03191-8.


Patterns and determinants of response to novel therapies in juvenile and adult-onset polyarthritis.

Triaille C, Quartier P, De Somer L, Durez P, Lauwerys B, Verschueren P Rheumatology (Oxford). 2023; 63(3):594-607.

PMID: 37725352 PMC: 10907821. DOI: 10.1093/rheumatology/kead490.


Patient groups in Rheumatoid arthritis identified by deep learning respond differently to biologic or targeted synthetic DMARDs.

Kalweit M, Burden A, Boedecker J, Hugle T, Burkard T PLoS Comput Biol. 2023; 19(6):e1011073.

PMID: 37267387 PMC: 10266686. DOI: 10.1371/journal.pcbi.1011073.


References
1.
Dougados M, Soubrier M, Antunez A, Balint P, Balsa A, Buch M . Prevalence of comorbidities in rheumatoid arthritis and evaluation of their monitoring: results of an international, cross-sectional study (COMORA). Ann Rheum Dis. 2013; 73(1):62-8. PMC: 3888623. DOI: 10.1136/annrheumdis-2013-204223. View

2.
Orange D, Agius P, DiCarlo E, Robine N, Geiger H, Szymonifka J . Identification of Three Rheumatoid Arthritis Disease Subtypes by Machine Learning Integration of Synovial Histologic Features and RNA Sequencing Data. Arthritis Rheumatol. 2018; 70(5):690-701. PMC: 6336443. DOI: 10.1002/art.40428. View

3.
Chin C, Hsieh S, Tseng V . eDRAM: Effective early disease risk assessment with matrix factorization on a large-scale medical database: A case study on rheumatoid arthritis. PLoS One. 2018; 13(11):e0207579. PMC: 6261027. DOI: 10.1371/journal.pone.0207579. View

4.
Basile A, Ritchie M . Informatics and machine learning to define the phenotype. Expert Rev Mol Diagn. 2018; 18(3):219-226. PMC: 6080627. DOI: 10.1080/14737159.2018.1439380. View

5.
Lin C, Karlson E, Canhao H, Miller T, Dligach D, Chen P . Automatic prediction of rheumatoid arthritis disease activity from the electronic medical records. PLoS One. 2013; 8(8):e69932. PMC: 3745469. DOI: 10.1371/journal.pone.0069932. View