» Articles » PMID: 21513508

The Proteogenomic Mapping Tool

Overview
Publisher Biomed Central
Specialty Biology
Date 2011 Apr 26
PMID 21513508
Citations 18
Authors
Affiliations
Soon will be listed here.
Abstract

Background: High-throughput mass spectrometry (MS) proteomics data is increasingly being used to complement traditional structural genome annotation methods. To keep pace with the high speed of experimental data generation and to aid in structural genome annotation, experimentally observed peptides need to be mapped back to their source genome location quickly and exactly. Previously, the tools to do this have been limited to custom scripts designed by individual research groups to analyze their own data, are generally not widely available, and do not scale well with large eukaryotic genomes.

Results: The Proteogenomic Mapping Tool includes a Java implementation of the Aho-Corasick string searching algorithm which takes as input standardized file types and rapidly searches experimentally observed peptides against a given genome translated in all 6 reading frames for exact matches. The Java implementation allows the application to scale well with larger eukaryotic genomes while providing cross-platform functionality.

Conclusions: The Proteogenomic Mapping Tool provides a standalone application for mapping peptides back to their source genome on a number of operating system platforms with standard desktop computer hardware and executes very rapidly for a variety of datasets. Allowing the selection of different genetic codes for different organisms allows researchers to easily customize the tool to their own research interests and is recommended for anyone working to structurally annotate genomes using MS derived proteomics data.

Citing Articles

An overview of technologies for MS-based proteomics-centric multi-omics.

Rajczewski A, Jagtap P, Griffin T Expert Rev Proteomics. 2022; 19(3):165-181.

PMID: 35466851 PMC: 9613604. DOI: 10.1080/14789450.2022.2070476.


Mapping Microproteins and ncRNA-Encoded Polypeptides in Different Mouse Tissues.

Pan N, Wang Z, Wang B, Wan J, Wan C Front Cell Dev Biol. 2021; 9:687748.

PMID: 34381774 PMC: 8350139. DOI: 10.3389/fcell.2021.687748.


Improved Identification of Small Open Reading Frames Encoded Peptides by Top-Down Proteomic Approaches and De Novo Sequencing.

Wang B, Wang Z, Pan N, Huang J, Wan C Int J Mol Sci. 2021; 22(11).

PMID: 34067398 PMC: 8197016. DOI: 10.3390/ijms22115476.


Peptimapper: proteogenomics workflow for the expert annotation of eukaryotic genomes.

Guillot L, Delage L, Viari A, Vandenbrouck Y, Com E, Ritter A BMC Genomics. 2019; 20(1):56.

PMID: 30654742 PMC: 6337836. DOI: 10.1186/s12864-019-5431-9.


A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes.

Schlaffner C, Pirklbauer G, Bender A, Steen J, Choudhary J J Vis Exp. 2018; (135).

PMID: 29889196 PMC: 6101353. DOI: 10.3791/57633.


References
1.
Wu Q, Krainer A . AT-AC pre-mRNA splicing mechanisms and conservation of minor introns in voltage-gated ion channel genes. Mol Cell Biol. 1999; 19(5):3225-36. PMC: 84117. DOI: 10.1128/MCB.19.5.3225. View

2.
Nanduri B, Wang N, Lawrence M, Bridges S, Burgess S . Gene model detection using mass spectrometry. Methods Mol Biol. 2009; 604:137-44. DOI: 10.1007/978-1-60761-444-9_10. View

3.
Ferro M, Tardif M, Reguer E, Cahuzac R, Bruley C, Vermat T . PepLine: a software pipeline for high-throughput direct mapping of tandem mass spectrometry data on genomic sequences. J Proteome Res. 2008; 7(5):1873-83. DOI: 10.1021/pr070415k. View

4.
Jaffe J, Berg H, Church G . Proteogenomic mapping as a complementary method to perform genome annotation. Proteomics. 2004; 4(1):59-77. DOI: 10.1002/pmic.200300511. View

5.
Castellana N, Payne S, Shen Z, Stanke M, Bafna V, Briggs S . Discovery and revision of Arabidopsis genes by proteogenomics. Proc Natl Acad Sci U S A. 2008; 105(52):21034-8. PMC: 2605632. DOI: 10.1073/pnas.0811066106. View