» Articles » PMID: 20003433

TransportTP: a Two-phase Classification Approach for Membrane Transporter Prediction and Characterization

Overview
Publisher Biomed Central
Specialty Biology
Date 2009 Dec 17
PMID 20003433
Citations 24
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Membrane transporters play crucial roles in living cells. Experimental characterization of transporters is costly and time-consuming. Current computational methods for transporter characterization still require extensive curation efforts, especially for eukaryotic organisms. We developed a novel genome-scale transporter prediction and characterization system called TransportTP that combined homology-based and machine learning methods in a two-phase classification approach. First, traditional homology methods were employed to predict novel transporters based on sequence similarity to known classified proteins in the Transporter Classification Database (TCDB). Second, machine learning methods were used to integrate a variety of features to refine the initial predictions. A set of rules based on transporter features was developed by machine learning using well-curated proteomes as guides.

Results: In a cross-validation using the yeast proteome for training and the proteomes of ten other organisms for testing, TransportTP achieved an equivalent recall and precision of 81.8%, based on TransportDB, a manually annotated transporter database. In an independent test using the Arabidopsis proteome for training and four recently sequenced plant proteomes for testing, it achieved a recall of 74.6% and a precision of 73.4%, according to our manual curation.

Conclusions: TransportTP is the most effective tool for eukaryotic transporter characterization up to date.

Citing Articles

Genome evolution of a nonparasitic secondary heterotroph, the diatom .

Kamikawa R, Mochizuki T, Sakamoto M, Tanizawa Y, Nakayama T, Onuma R Sci Adv. 2022; 8(17):eabi5075.

PMID: 35486731 PMC: 9054022. DOI: 10.1126/sciadv.abi5075.


Utilizing Plant Synthetic Biology to Improve Human Health and Wellness.

Barnum C, Endelman B, Shih P Front Plant Sci. 2021; 12:691462.

PMID: 34504505 PMC: 8421571. DOI: 10.3389/fpls.2021.691462.


Addressing uncertainty in genome-scale metabolic model reconstruction and analysis.

Bernstein D, Sulheim S, Almaas E, Segre D Genome Biol. 2021; 22(1):64.

PMID: 33602294 PMC: 7890832. DOI: 10.1186/s13059-021-02289-z.


Transportome-wide engineering of Saccharomyces cerevisiae.

Wang G, Moller-Hansen I, Babaei M, DAmbrosio V, Christensen H, Darbani B Metab Eng. 2021; 64:52-63.

PMID: 33465478 PMC: 7970624. DOI: 10.1016/j.ymben.2021.01.007.


Bodo saltans (Kinetoplastida) is dependent on a novel Paracaedibacter-like endosymbiont that possesses multiple putative toxin-antitoxin systems.

Midha S, Rigden D, Siozios S, Hurst G, Jackson A ISME J. 2021; 15(6):1680-1694.

PMID: 33452479 PMC: 8163844. DOI: 10.1038/s41396-020-00879-6.


References
1.
Lee M, Jeong C, Kim D . Predicting and improving the protein sequence alignment quality by support vector regression. BMC Bioinformatics. 2007; 8:471. PMC: 2222655. DOI: 10.1186/1471-2105-8-471. View

2.
Sakmann B, Neher E . Patch clamp techniques for studying ionic channels in excitable membranes. Annu Rev Physiol. 1984; 46:455-72. DOI: 10.1146/annurev.ph.46.030184.002323. View

3.
Li H, Dai X, Zhao X . A nearest neighbor approach for automated transporter prediction and categorization from protein sequences. Bioinformatics. 2008; 24(9):1129-36. DOI: 10.1093/bioinformatics/btn099. View

4.
Heil B, Ludwig J, Lichtenberg-Frate H, Lengauer T . Computational recognition of potassium channel sequences. Bioinformatics. 2006; 22(13):1562-8. DOI: 10.1093/bioinformatics/btl132. View

5.
Pruitt K, Tatusova T, Maglott D . NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2004; 33(Database issue):D501-4. PMC: 539979. DOI: 10.1093/nar/gki025. View