» Articles » PMID: 38946953

Novel Machine Learning of DNA Methylation Patterns to Diagnose Complex Disease: Identification of Cerebral Palsy with Concurrent Epilepsy

Overview
Journal Res Sq
Date 2024 Jul 1
PMID 38946953
Authors
Affiliations
Soon will be listed here.
Abstract

Spastic cerebral palsy (CP) is a common pediatric-onset disability with an estimated prevalence of 0.2%. It is a complex condition characterized by muscle stiffness, contractures, and abnormal movement. Spastic CP is difficult to diagnose. Although nearly all affected children are born with it or acquire it immediately after birth, many are not identified until after 19 months of age with the diagnosis often not confirmed until 5 years of age. In addition, CP frequently co-occurs with other complex conditions that can complicate diagnosis and treatment. For example, an estimated 42% of spastic CP cases have co-occurring epilepsy. Recent studies indicate that altered DNA methylation patterns in peripheral blood cells are associated with CP and may have diagnostic value.Accordingly, the purpose of this study is to assess the diagnostic value of methylation in CP with more complex disease states. We evaluated machine learning classification for detecting CP based on DNA methylation pattern analysis in the context of co-occurrent epilepsy. Blood samples from 30 study participants diagnosed with epilepsy (n=4), spastic CP (n=10), both (n=8), or neither (n=8) were analyzed by Illumina MethylationEpic arrays. A novel machine learning algorithm using a Support Vector Machine (SVM) or Linear Discriminant Analysis (LDA) was developed to identify methylation loci that classified CP from controls and to measure the classification ability of identified methylation loci. The isolation of informative methylation loci was performed in a binary comparison between CP and controls, as well as in a 4-way comparison that included epilepsy. Median F1 scores for SVM-based analysis were 0.67 in 4-class comparison, and 1.0 in the binary classification. SVM outperformed LDA (median F1 0.57 and 0.86, respectively). Overall, the novel machine learning based algorithm was able to classify study participants with spastic CP and/or epilepsy from controls with significant performance.

References
1.
Leek J, Johnson W, Parker H, Jaffe A, Storey J . The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012; 28(6):882-3. PMC: 3307112. DOI: 10.1093/bioinformatics/bts034. View

2.
Aryee M, Jaffe A, Corrada-Bravo H, Ladd-Acosta C, Feinberg A, Hansen K . Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014; 30(10):1363-9. PMC: 4016708. DOI: 10.1093/bioinformatics/btu049. View

3.
Crowgey E, Marsh A, Robinson K, Yeager S, Akins R . Epigenetic machine learning: utilizing DNA methylation patterns to predict spastic cerebral palsy. BMC Bioinformatics. 2018; 19(1):225. PMC: 6011336. DOI: 10.1186/s12859-018-2224-0. View

4.
Mohandas N, Bass-Stringer S, Maksimovic J, Crompton K, Loke Y, Walstab J . Epigenome-wide analysis in newborn blood spots from monozygotic twins discordant for cerebral palsy reveals consistent regional differences in DNA methylation. Clin Epigenetics. 2018; 10:25. PMC: 5824607. DOI: 10.1186/s13148-018-0457-4. View

5.
Chen A, Dyck Holzinger S, Oskoui M, Shevell M . Losing a diagnosis of cerebral palsy: a comparison of variables at 2 and 5 years. Dev Med Child Neurol. 2019; 62(1):83-88. DOI: 10.1111/dmcn.14309. View