» Articles » PMID: 34762822

Identification of Discriminative Gene-level and Protein-level Features Associated with Pathogenic Gain-of-function and Loss-of-function Variants

Overview
Journal Am J Hum Genet
Publisher Cell Press
Specialty Genetics
Date 2021 Nov 11
PMID 34762822
Citations 16
Authors
Affiliations
Soon will be listed here.
Abstract

Identifying whether a given genetic mutation results in a gene product with increased (gain-of-function; GOF) or diminished (loss-of-function; LOF) activity is an important step toward understanding disease mechanisms because they may result in markedly different clinical phenotypes. Here, we generated an extensive database of documented germline GOF and LOF pathogenic variants by employing natural language processing (NLP) on the available abstracts in the Human Gene Mutation Database. We then investigated various gene- and protein-level features of GOF and LOF variants and applied machine learning and statistical analyses to identify discriminative features. We found that GOF variants were enriched in essential genes, for autosomal-dominant inheritance, and in protein binding and interaction domains, whereas LOF variants were enriched in singleton genes, for protein-truncating variants, and in protein core regions. We developed a user-friendly web-based interface that enables the extraction of selected subsets from the GOF/LOF database by a broad set of annotated features and downloading of up-to-date versions. These results improve our understanding of how variants affect gene/protein function and may ultimately guide future treatment options.

Citing Articles

Ligand distances as key predictors of pathogenicity and function in NMDA receptors.

Montanucci L, Brunger T, Bhattarai N, Bosselmann C, Kim S, Allen J Hum Mol Genet. 2024; 34(2):128-139.

PMID: 39535073 PMC: 11780861. DOI: 10.1093/hmg/ddae156.


Proteome-scale prediction of molecular mechanisms underlying dominant genetic diseases.

Badonyi M, Marsh J PLoS One. 2024; 19(8):e0307312.

PMID: 39172982 PMC: 11341024. DOI: 10.1371/journal.pone.0307312.


Recurrent carotid paragangliomas in a syndromic patient with a heterozygous missense variant in DNA Methyltransferase 3 Alpha.

German R, Vuocolo B, Vossaert L, Saba L, Fletcher R, Tedder M Am J Med Genet A. 2024; 197(1):e63849.

PMID: 39166703 PMC: 11637962. DOI: 10.1002/ajmg.a.63849.


Insights into the ANKRD11 variants and short-stature phenotype through literature review and ClinVar database search.

He D, Zhang M, Li Y, Liu F, Ban B Orphanet J Rare Dis. 2024; 19(1):292.

PMID: 39135054 PMC: 11318275. DOI: 10.1186/s13023-024-03301-y.


Individuals with JAK1 variants are affected by syndromic features encompassing autoimmunity, atopy, colitis, and dermatitis.

Horesh M, Martin-Fernandez M, Gruber C, Buta S, Le Voyer T, Puzenat E J Exp Med. 2024; 221(6).

PMID: 38563820 PMC: 10986756. DOI: 10.1084/jem.20232387.


References
1.
Peng Y, Alexov E, Basu S . Structural Perspective on Revealing and Altering Molecular Functions of Genetic Variants Linked with Diseases. Int J Mol Sci. 2019; 20(3). PMC: 6386852. DOI: 10.3390/ijms20030548. View

2.
Snow A, Xiao W, Stinson J, Lu W, Chaigne-Delalande B, Zheng L . Congenital B cell lymphocytosis explained by novel germline CARD11 mutations. J Exp Med. 2012; 209(12):2247-61. PMC: 3501355. DOI: 10.1084/jem.20120831. View

3.
Karczewski K, Francioli L, Tiao G, Cummings B, Alfoldi J, Wang Q . The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020; 581(7809):434-443. PMC: 7334197. DOI: 10.1038/s41586-020-2308-7. View

4.
Zhong Q, Simonis N, Li Q, Charloteaux B, Heuze F, Klitgord N . Edgetic perturbation models of human inherited disorders. Mol Syst Biol. 2009; 5:321. PMC: 2795474. DOI: 10.1038/msb.2009.80. View

5.
Khurana E, Fu Y, Chen J, Gerstein M . Interpretation of genomic variants using a unified biological network approach. PLoS Comput Biol. 2013; 9(3):e1002886. PMC: 3591262. DOI: 10.1371/journal.pcbi.1002886. View