» Articles » PMID: 32165711

Forecasting Risk Gene Discovery in Autism with Machine Learning and Genome-scale Data

Overview
Journal Sci Rep
Specialty Science
Date 2020 Mar 14
PMID 32165711
Citations 19
Authors
Affiliations
Soon will be listed here.
Abstract

Genetics has been one of the most powerful windows into the biology of autism spectrum disorder (ASD). It is estimated that a thousand or more genes may confer risk for ASD when functionally perturbed, however, only around 100 genes currently have sufficient evidence to be considered true "autism risk genes". Massive genetic studies are currently underway producing data to implicate additional genes. This approach - although necessary - is costly and slow-moving, making identification of putative ASD risk genes with existing data vital. Here, we approach autism risk gene discovery as a machine learning problem, rather than a genetic association problem, by using genome-scale data as predictors to identify new genes with similar properties to established autism risk genes. This ensemble method, forecASD, integrates brain gene expression, heterogeneous network data, and previous gene-level predictors of autism association into an ensemble classifier that yields a single score indexing evidence of each gene's involvement in the etiology of autism. We demonstrate that forecASD has substantially better performance than previous predictors of autism association in three independent trio-based sequencing studies. Studying forecASD prioritized genes, we show that forecASD is a robust indicator of a gene's involvement in ASD etiology, with diverse applications to gene discovery, differential expression analysis, eQTL prioritization, and pathway enrichment analysis.

Citing Articles

Proximity analysis of native proteomes reveals phenotypic modifiers in a mouse model of autism and related neurodevelopmental conditions.

Gao Y, Shonai D, Trn M, Zhao J, Soderblom E, Garcia-Moreno S Nat Commun. 2024; 15(1):6801.

PMID: 39122707 PMC: 11316102. DOI: 10.1038/s41467-024-51037-x.


The Importance of Large-Scale Genomic Studies to Unravel Genetic Risk Factors for Autism.

Nobrega I, Teles E Silva A, Yokota-Moreno B, Sertie A Int J Mol Sci. 2024; 25(11).

PMID: 38892002 PMC: 11172008. DOI: 10.3390/ijms25115816.


Graph Node Classification to Predict Autism Risk in Genes.

Bandara D, Riccardi K Genes (Basel). 2024; 15(4).

PMID: 38674382 PMC: 11049455. DOI: 10.3390/genes15040447.


A network-based method for associating genes with autism spectrum disorder.

Zadok N, Ast G, Sharan R Front Bioinform. 2024; 4:1295600.

PMID: 38525240 PMC: 10960359. DOI: 10.3389/fbinf.2024.1295600.


Integration of genome-scale data identifies candidate sleep regulators.

Lee Y, Endale M, Wu G, Ruben M, Francey L, Morris A Sleep. 2022; 46(2).

PMID: 36462188 PMC: 9905783. DOI: 10.1093/sleep/zsac279.


References
1.
ORoak B, Vives L, Girirajan S, Karakoc E, Krumm N, Coe B . Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature. 2012; 485(7397):246-50. PMC: 3350576. DOI: 10.1038/nature10989. View

2.
Sunkin S, Ng L, Lau C, Dolbeare T, Gilbert T, Thompson C . Allen Brain Atlas: an integrated spatio-temporal portal for exploring the central nervous system. Nucleic Acids Res. 2012; 41(Database issue):D996-D1008. PMC: 3531093. DOI: 10.1093/nar/gks1042. View

3.
von Mering C, Huynen M, Jaeggi D, Schmidt S, Bork P, Snel B . STRING: a database of predicted functional associations between proteins. Nucleic Acids Res. 2003; 31(1):258-61. PMC: 165481. DOI: 10.1093/nar/gkg034. View

4.
Lipton J, Boyle L, Yuan E, Hochstrasser K, Chifamba F, Nathan A . Aberrant Proteostasis of BMAL1 Underlies Circadian Abnormalities in a Paradigmatic mTOR-opathy. Cell Rep. 2017; 20(4):868-880. PMC: 5603761. DOI: 10.1016/j.celrep.2017.07.008. View

5.
Monyak R, Emerson D, Schoenfeld B, Zheng X, Chambers D, Rosenfelt C . Insulin signaling misregulation underlies circadian and cognitive deficits in a Drosophila fragile X model. Mol Psychiatry. 2016; 22(8):1140-1148. PMC: 5071102. DOI: 10.1038/mp.2016.51. View