» Articles » PMID: 20090771

Supervised Machine Learning and Logistic Regression Identifies Novel Epistatic Risk Factors with PTPN22 for Rheumatoid Arthritis

Overview
Journal Genes Immun
Date 2010 Jan 22
PMID 20090771
Citations 20
Authors
Affiliations
Soon will be listed here.
Abstract

Investigating genetic interactions (epistasis) has proven difficult despite the recent advances of both laboratory methods and statistical developments. With no 'best' statistical approach available, combining several analytical methods may be optimal for detecting epistatic interactions. Using a multi-stage analysis that incorporated supervised machine learning and methods of association testing, we investigated epistatic interactions with a well-established genetic factor (PTPN22 1858T) in a complex autoimmune disease (rheumatoid arthritis (RA)). Our analysis consisted of four principal stages: Stage I (data reduction)-identifying candidate chromosomal regions in 292 affected sibling pairs, by predicting PTPN22 concordance using multipoint identity-by-descent probabilities and a supervised machine learning algorithm (Random Forests); Stage II (extension analysis)-testing detailed genetic data within candidate chromosomal regions for epistasis with PTPN22 1858T in 677 cases and 750 controls using logistic regression; Stage III (replication analysis)-confirmation of epistatic interactions in 947 cases and 1756 controls; Stage IV (combined analysis)-a pooled analysis including all 1624 RA cases and 2506 control subjects for final estimates of effect size. A total of seven replicating epistatic interactions were identified. SNP variants within CDH13, MYO3A, CEP72 and near WFDC1 showed significant evidence for interaction with PTPN22, affecting susceptibility to RA.

Citing Articles

Scoping review: Machine learning interventions in the management of healthcare systems.

Arueyingho O, Al-Taie A, McCallum C Digit Health. 2024; 10:20552076221144095.

PMID: 39444734 PMC: 11497546. DOI: 10.1177/20552076221144095.


Distributed transformer for high order epistasis detection in large-scale datasets.

Graca M, Nobre R, Sousa L, Ilic A Sci Rep. 2024; 14(1):14579.

PMID: 38918413 PMC: 11199512. DOI: 10.1038/s41598-024-65317-5.


Prediction of Liver Enzyme Elevation Using Supervised Machine Learning in Patients With Rheumatoid Arthritis on Treatment with Methotrexate.

Surendran S, B M, Gilvaz V, Manyam P, Panicker K, Pradeep M Cureus. 2024; 16(1):e52110.

PMID: 38344615 PMC: 10858738. DOI: 10.7759/cureus.52110.


Artificial Intelligence in Rheumatoid Arthritis: Current Status and Future Perspectives: A State-of-the-Art Review.

Momtazmanesh S, Nowroozi A, Rezaei N Rheumatol Ther. 2022; 9(5):1249-1304.

PMID: 35849321 PMC: 9510088. DOI: 10.1007/s40744-022-00475-4.


Big data analyses and individual health profiling in the arena of rheumatic and musculoskeletal diseases (RMDs).

De Cock D, Myasoedova E, Aletaha D, Studenic P Ther Adv Musculoskelet Dis. 2022; 14:1759720X221105978.

PMID: 35794905 PMC: 9251966. DOI: 10.1177/1759720X221105978.


References
1.
Ma L, Dvorkin D, Garbe J, Da Y . Genome-wide analysis of single-locus and epistasis single-nucleotide polymorphism effects on anti-cyclic citrullinated peptide as a measure of rheumatoid arthritis. BMC Proc. 2008; 1 Suppl 1:S127. PMC: 2367477. DOI: 10.1186/1753-6561-1-s1-s127. View

2.
Philippova M, Ivanov D, Allenspach R, Takuwa Y, Erne P, Resink T . RhoA and Rac mediate endothelial cell polarization and detachment induced by T-cadherin. FASEB J. 2005; 19(6):588-90. DOI: 10.1096/fj.04-2430fje. View

3.
Musani S, Shriner D, Liu N, Feng R, Coffey C, Yi N . Detection of gene x gene interactions in genome-wide association studies of human population data. Hum Hered. 2007; 63(2):67-84. DOI: 10.1159/000099179. View

4.
Pritchard J, Stephens M, Rosenberg N, Donnelly P . Association mapping in structured populations. Am J Hum Genet. 2000; 67(1):170-81. PMC: 1287075. DOI: 10.1086/302959. View

5.
Criswell L, Pfeiffer K, Lum R, Gonzales B, Novitzke J, Kern M . Analysis of families in the multiple autoimmune disease genetics consortium (MADGC) collection: the PTPN22 620W allele associates with multiple autoimmune phenotypes. Am J Hum Genet. 2005; 76(4):561-71. PMC: 1199294. DOI: 10.1086/429096. View