» Articles » PMID: 24256031

Detecting Small Plant Peptides Using SPADA (Small Peptide Alignment Discovery Application)

Overview
Publisher Biomed Central
Specialty Biology
Date 2013 Nov 22
PMID 24256031
Citations 51
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Small peptides encoded as one- or two-exon genes in plants have recently been shown to affect multiple aspects of plant development, reproduction and defense responses. However, popular similarity search tools and gene prediction techniques generally fail to identify most members belonging to this class of genes. This is largely due to the high sequence divergence among family members and the limited availability of experimentally verified small peptides to use as training sets for homology search and ab initio prediction. Consequently, there is an urgent need for both experimental and computational studies in order to further advance the accurate prediction of small peptides.

Results: We present here a homology-based gene prediction program to accurately predict small peptides at the genome level. Given a high-quality profile alignment, SPADA identifies and annotates nearly all family members in tested genomes with better performance than all general-purpose gene prediction programs surveyed. We find numerous mis-annotations in the current Arabidopsis thaliana and Medicago truncatula genome databases using SPADA, most of which have RNA-Seq expression support. We also show that SPADA works well on other classes of small secreted peptides in plants (e.g., self-incompatibility protein homologues) as well as non-secreted peptides outside the plant kingdom (e.g., the alpha-amanitin toxin gene family in the mushroom, Amanita bisporigera).

Conclusions: SPADA is a free software tool that accurately identifies and predicts the gene structure for short peptides with one or two exons. SPADA is able to incorporate information from profile alignments into the model prediction process and makes use of it to score different candidate models. SPADA achieves high sensitivity and specificity in predicting small plant peptides such as the cysteine-rich peptide families. A systematic application of SPADA to other classes of small peptides by research communities will greatly improve the genome annotation of different protein families in public genome databases.

Citing Articles

Peptide hormones in plants.

Zhang Z, Han H, Zhao J, Liu Z, Deng L, Wu L Mol Hortic. 2025; 5(1):7.

PMID: 39849641 PMC: 11756074. DOI: 10.1186/s43897-024-00134-y.


sOCP: a framework predicting smORF coding potential based on TIS and in-frame features and effectively applied in the human genome.

Peng Z, Li J, Jiang X, Wan C Brief Bioinform. 2024; 25(3).

PMID: 38600664 PMC: 11006793. DOI: 10.1093/bib/bbae147.


Exploring the role of symbiotic modifier peptidases in the legume - rhizobium symbiosis.

Ghosh P, Chakraborty J Arch Microbiol. 2024; 206(4):147.

PMID: 38462552 DOI: 10.1007/s00203-024-03920-w.


Improved super-resolution ribosome profiling reveals prevalent translation of upstream ORFs and small ORFs in Arabidopsis.

Wu H, Ai Q, Teixeira R, Nguyen P, Song G, Montes C Plant Cell. 2023; 36(3):510-539.

PMID: 38000896 PMC: 10896292. DOI: 10.1093/plcell/koad290.


Peptidomics Methods Applied to the Study of Flower Development.

Alvarez-Urdiola R, Borras E, Valverde F, Matus J, Sabido E, Riechmann J Methods Mol Biol. 2023; 2686:509-536.

PMID: 37540375 DOI: 10.1007/978-1-0716-3299-4_24.


References
1.
Henikoff S, Henikoff J . Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992; 89(22):10915-9. PMC: 50453. DOI: 10.1073/pnas.89.22.10915. View

2.
Sievers F, Wilm A, Dineen D, Gibson T, Karplus K, Li W . Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011; 7:539. PMC: 3261699. DOI: 10.1038/msb.2011.75. View

3.
Wang D, Griffitts J, Starker C, Fedorova E, Limpens E, Ivanov S . A nodule-specific protein secretory pathway required for nitrogen-fixing symbiosis. Science. 2010; 327(5969):1126-9. PMC: 4824053. DOI: 10.1126/science.1184096. View

4.
Pan B, Sheng J, Sun W, Zhao Y, Hao P, Li X . OrysPSSP: a comparative platform for small secreted proteins from rice and other plants. Nucleic Acids Res. 2012; 41(Database issue):D1192-8. PMC: 3531210. DOI: 10.1093/nar/gks1090. View

5.
Burset M, Guigo R . Evaluation of gene structure prediction programs. Genomics. 1996; 34(3):353-67. DOI: 10.1006/geno.1996.0298. View