» Articles » PMID: 24369432

High-resolution Functional Annotation of Human Transcriptome: Predicting Isoform Functions by a Novel Multiple Instance-based Label Propagation Method

Overview
Specialty Biochemistry
Date 2013 Dec 27
PMID 24369432
Citations 27
Authors
Affiliations
Soon will be listed here.
Abstract

Alternative transcript processing is an important mechanism for generating functional diversity in genes. However, little is known about the precise functions of individual isoforms. In fact, proteins (translated from transcript isoforms), not genes, are the function carriers. By integrating multiple human RNA-seq data sets, we carried out the first systematic prediction of isoform functions, enabling high-resolution functional annotation of human transcriptome. Unlike gene function prediction, isoform function prediction faces a unique challenge: the lack of the training data--all known functional annotations are at the gene level. To address this challenge, we modelled the gene-isoform relationships as multiple instance data and developed a novel label propagation method to predict functions. Our method achieved an average area under the receiver operating characteristic curve of 0.67 and assigned functions to 15 572 isoforms. Interestingly, we observed that different functions have different sensitivities to alternative isoform processing, and that the function diversity of isoforms from the same gene is positively correlated with their tissue expression diversity. Finally, we surveyed the literature to validate our predictions for a number of apoptotic genes. Strikingly, for the famous 'TP53' gene, we not only accurately identified the apoptosis regulation function of its five isoforms, but also correctly predicted the precise direction of the regulation.

Citing Articles

CrossIsoFun: predicting isoform functions using the integration of multi-omics data.

Liu Y, Li H, Wang J Bioinformatics. 2024; 41(1).

PMID: 39680906 PMC: 11706537. DOI: 10.1093/bioinformatics/btae742.


IsoFrog: a reversible jump Markov Chain Monte Carlo feature selection-based method for predicting isoform functions.

Liu Y, Yang C, Li H, Wang J Bioinformatics. 2023; 39(9).

PMID: 37647643 PMC: 10491952. DOI: 10.1093/bioinformatics/btad530.


An expectation-maximization framework for comprehensive prediction of isoform-specific functions.

Karlebach G, Carmody L, Sundaramurthi J, Casiraghi E, Hansen P, Reese J Bioinformatics. 2023; 39(4).

PMID: 36929917 PMC: 10079350. DOI: 10.1093/bioinformatics/btad132.


A Global Analysis of Alternative Splicing of Medicinal Plants, Ranunculales.

Hao D, Chen H, Xiao P, Jiang T Curr Genomics. 2023; 23(3):207-216.

PMID: 36777007 PMC: 9878827. DOI: 10.2174/1389202923666220527112929.


Evolution of isoform-level gene expression patterns across tissues during lotus species divergence.

Zhang Y, Yang X, Van de Peer Y, Chen J, Marchal K, Shi T Plant J. 2022; 112(3):830-846.

PMID: 36123806 PMC: 7613771. DOI: 10.1111/tpj.15984.


References
1.
Youngs N, Penfold-Brown D, Drew K, Shasha D, Bonneau R . Parametric Bayesian priors and better choice of negative examples improve protein function prediction. Bioinformatics. 2013; 29(9):1190-8. PMC: 3634187. DOI: 10.1093/bioinformatics/btt110. View

2.
Syken J, Munger K . TID1, a human homolog of the Drosophila tumor suppressor l(2)tid, encodes two mitochondrial modulators of apoptosis with opposing functions. Proc Natl Acad Sci U S A. 1999; 96(15):8499-504. PMC: 17545. DOI: 10.1073/pnas.96.15.8499. View

3.
Ellis J, Barrios-Rodiles M, Colak R, Irimia M, Kim T, Calarco J . Tissue-specific alternative splicing remodels protein-protein interaction networks. Mol Cell. 2012; 46(6):884-92. DOI: 10.1016/j.molcel.2012.05.037. View

4.
Li W, Liu C, Zhang T, Li H, Waterman M, Zhou X . Integrative analysis of many weighted co-expression networks using tensor computation. PLoS Comput Biol. 2011; 7(6):e1001106. PMC: 3116899. DOI: 10.1371/journal.pcbi.1001106. View

5.
Warde-Farley D, Donaldson S, Comes O, Zuberi K, Badrawi R, Chao P . The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 2010; 38(Web Server issue):W214-20. PMC: 2896186. DOI: 10.1093/nar/gkq537. View