» Articles » PMID: 35398963

PPVED: A Machine Learning Tool for Predicting the Effect of Single Amino Acid Substitution on Protein Function in Plants

Overview
Specialties Biology
Biotechnology
Date 2022 Apr 10
PMID 35398963
Authors
Affiliations
Soon will be listed here.
Abstract

Single amino acid substitution (SAAS) produces the most common variant of protein function change under physiological conditions. As the number of SAAS events in plants has increased exponentially, an effective prediction tool is required to help identify and distinguish functional SAASs from the whole genome as either potentially causal traits or as variants. Here, we constructed a plant SAAS database that stores 12 865 SAASs in 6172 proteins and developed a tool called Plant Protein Variation Effect Detector (PPVED) that predicts the effect of SAASs on protein function in plants. PPVED achieved an 87% predictive accuracy when applied to plant SAASs, an accuracy that was much higher than those from six human database software: SIFT, PROVEAN, PANTHER-PSEP, PhD-SNP, PolyPhen-2, and MutPred2. The predictive effect of six SAASs from three proteins in Arabidopsis and maize was validated with wet lab experiments, of which five substitution sites were accurately predicted. PPVED could facilitate the identification and characterization of genetic variants that explain observed phenotype variations in plants, contributing to solutions for challenges in functional genomics and systems biology. PPVED can be accessed under a CC-BY (4.0) license via http://www.ppved.org.cn.

Citing Articles

Analysis of the genetic basis of fiber-related traits and flowering time in upland cotton using machine learning.

Li W, Zhang M, Fan J, Yang Z, Peng J, Zhang J Theor Appl Genet. 2025; 138(1):36.

PMID: 39853381 DOI: 10.1007/s00122-025-04821-2.


A chromosome-level genome assembly of , a tomato wild relative associated with resistance to salinity and drought.

Molitor C, Kurowski T, Fidalgo de Almeida P, Kevei Z, Spindlow D, Chacko Kaitholil S Front Plant Sci. 2024; 15:1342739.

PMID: 38525148 PMC: 10957597. DOI: 10.3389/fpls.2024.1342739.


Identification and haplotype analysis of SiCHLI: a gene for yellow-green seedling as morphological marker to accelerate foxtail millet (Setaria italica) hybrid breeding.

Liang H, He Q, Zhang H, Zhi H, Tang S, Wang H Theor Appl Genet. 2023; 136(1):24.

PMID: 36739566 DOI: 10.1007/s00122-023-04309-x.


A genome-wide association study of folates in sweet corn kernels.

Xiao Y, Yu Y, Xie L, Li K, Guo X, Li G Front Plant Sci. 2022; 13:1004455.

PMID: 36247547 PMC: 9562826. DOI: 10.3389/fpls.2022.1004455.


ABA-inducible DEEPER ROOTING 1 improves adaptation of maize to water deficiency.

Feng X, Jia L, Cai Y, Guan H, Zheng D, Zhang W Plant Biotechnol J. 2022; 20(11):2077-2088.

PMID: 35796628 PMC: 9616520. DOI: 10.1111/pbi.13889.


References
1.
Clough S, Bent A . Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 1999; 16(6):735-43. DOI: 10.1046/j.1365-313x.1998.00343.x. View

2.
Gupta R, Brunak S . Prediction of glycosylation across the human proteome and the correlation to protein function. Pac Symp Biocomput. 2002; :310-22. View

3.
Bromberg Y, Rost B . SNAP: predict effect of non-synonymous polymorphisms on function. Nucleic Acids Res. 2007; 35(11):3823-35. PMC: 1920242. DOI: 10.1093/nar/gkm238. View

4.
Quang D, Chen Y, Xie X . DANN: a deep learning approach for annotating the pathogenicity of genetic variants. Bioinformatics. 2014; 31(5):761-3. PMC: 4341060. DOI: 10.1093/bioinformatics/btu703. View

5.
Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W . Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997; 25(17):3389-402. PMC: 146917. DOI: 10.1093/nar/25.17.3389. View