» Articles » PMID: 25552646

Comparison and Integration of Deleteriousness Prediction Methods for Nonsynonymous SNVs in Whole Exome Sequencing Studies

Overview
Journal Hum Mol Genet
Date 2015 Jan 2
PMID 25552646
Citations 593
Authors
Affiliations
Soon will be listed here.
Abstract

Accurate deleteriousness prediction for nonsynonymous variants is crucial for distinguishing pathogenic mutations from background polymorphisms in whole exome sequencing (WES) studies. Although many deleteriousness prediction methods have been developed, their prediction results are sometimes inconsistent with each other and their relative merits are still unclear in practical applications. To address these issues, we comprehensively evaluated the predictive performance of 18 current deleteriousness-scoring methods, including 11 function prediction scores (PolyPhen-2, SIFT, MutationTaster, Mutation Assessor, FATHMM, LRT, PANTHER, PhD-SNP, SNAP, SNPs&GO and MutPred), 3 conservation scores (GERP++, SiPhy and PhyloP) and 4 ensemble scores (CADD, PON-P, KGGSeq and CONDEL). We found that FATHMM and KGGSeq had the highest discriminative power among independent scores and ensemble scores, respectively. Moreover, to ensure unbiased performance evaluation of these prediction scores, we manually collected three distinct testing datasets, on which no current prediction scores were tuned. In addition, we developed two new ensemble scores that integrate nine independent scores and allele frequency. Our scores achieved the highest discriminative power compared with all the deleteriousness prediction scores tested and showed low false-positive prediction rate for benign yet rare nonsynonymous variants, which demonstrated the value of combining information from multiple orthologous approaches. Finally, to facilitate variant prioritization in WES studies, we have pre-computed our ensemble scores for 87 347 044 possible variants in the whole-exome and made them publicly available through the ANNOVAR software and the dbNSFP database.

Citing Articles

PON-P3: Accurate Prediction of Pathogenicity of Amino Acid Substitutions.

Kabir M, Ahmed S, Zhang H, Rodriguez-Rodriguez I, Najibi S, Vihinen M Int J Mol Sci. 2025; 26(5).

PMID: 40076632 PMC: 11899954. DOI: 10.3390/ijms26052004.


XGBMUT: Predicting the Functional Impact of Missense Mutations Using an Extreme Gradient Boost Classifier.

Pereira G, Da Conceicao L, Abrahim-Vieira B, Rodrigues C, Cabral L, Coelho R ACS Omega. 2025; 10(8):8349-8360.

PMID: 40060867 PMC: 11886911. DOI: 10.1021/acsomega.4c10179.


Meta-analysis reveals transcription factors and DNA binding domain variants associated with congenital heart defect and orofacial cleft.

Jeong R, Bulyk M medRxiv. 2025; .

PMID: 39974057 PMC: 11838631. DOI: 10.1101/2025.01.30.25321274.


Integrative analysis of KCNQ1 variants reveals molecular mechanisms of type 1 long QT syndrome pathogenesis.

Brewer K, Vanoye C, Huang H, Clowes Moster K, Desai R, Hayes J Proc Natl Acad Sci U S A. 2025; 122(8):e2412971122.

PMID: 39969993 PMC: 11873829. DOI: 10.1073/pnas.2412971122.


Identifying somatic driver mutations in cancer with a language model of the human genome.

Zeng G, Zhao C, Li G, Huang Z, Zhuang J, Liang X Comput Struct Biotechnol J. 2025; 27:531-540.

PMID: 39968174 PMC: 11833646. DOI: 10.1016/j.csbj.2025.01.011.


References
1.
Nair P, Vihinen M . VariBench: a benchmark database for variations. Hum Mutat. 2012; 34(1):42-9. DOI: 10.1002/humu.22204. View

2.
Liu X, Jian X, Boerwinkle E . dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations. Hum Mutat. 2013; 34(9):E2393-402. PMC: 4109890. DOI: 10.1002/humu.22376. View

3.
Flanagan S, Patch A, Ellard S . Using SIFT and PolyPhen to predict loss-of-function and gain-of-function mutations. Genet Test Mol Biomarkers. 2010; 14(4):533-7. DOI: 10.1089/gtmb.2010.0036. View

4.
. Ongoing and future developments at the Universal Protein Resource. Nucleic Acids Res. 2010; 39(Database issue):D214-9. PMC: 3013648. DOI: 10.1093/nar/gkq1020. View

5.
Thusberg J, Olatubosun A, Vihinen M . Performance of mutation pathogenicity prediction methods on missense variants. Hum Mutat. 2011; 32(4):358-68. DOI: 10.1002/humu.21445. View