High-resolution Functional Annotation of Human Transcriptome: Predicting Isoform Functions by a Novel Multiple Instance-based Label Propagation Method
Overview
Affiliations
Alternative transcript processing is an important mechanism for generating functional diversity in genes. However, little is known about the precise functions of individual isoforms. In fact, proteins (translated from transcript isoforms), not genes, are the function carriers. By integrating multiple human RNA-seq data sets, we carried out the first systematic prediction of isoform functions, enabling high-resolution functional annotation of human transcriptome. Unlike gene function prediction, isoform function prediction faces a unique challenge: the lack of the training data--all known functional annotations are at the gene level. To address this challenge, we modelled the gene-isoform relationships as multiple instance data and developed a novel label propagation method to predict functions. Our method achieved an average area under the receiver operating characteristic curve of 0.67 and assigned functions to 15 572 isoforms. Interestingly, we observed that different functions have different sensitivities to alternative isoform processing, and that the function diversity of isoforms from the same gene is positively correlated with their tissue expression diversity. Finally, we surveyed the literature to validate our predictions for a number of apoptotic genes. Strikingly, for the famous 'TP53' gene, we not only accurately identified the apoptosis regulation function of its five isoforms, but also correctly predicted the precise direction of the regulation.
CrossIsoFun: predicting isoform functions using the integration of multi-omics data.
Liu Y, Li H, Wang J Bioinformatics. 2024; 41(1).
PMID: 39680906 PMC: 11706537. DOI: 10.1093/bioinformatics/btae742.
Liu Y, Yang C, Li H, Wang J Bioinformatics. 2023; 39(9).
PMID: 37647643 PMC: 10491952. DOI: 10.1093/bioinformatics/btad530.
An expectation-maximization framework for comprehensive prediction of isoform-specific functions.
Karlebach G, Carmody L, Sundaramurthi J, Casiraghi E, Hansen P, Reese J Bioinformatics. 2023; 39(4).
PMID: 36929917 PMC: 10079350. DOI: 10.1093/bioinformatics/btad132.
A Global Analysis of Alternative Splicing of Medicinal Plants, Ranunculales.
Hao D, Chen H, Xiao P, Jiang T Curr Genomics. 2023; 23(3):207-216.
PMID: 36777007 PMC: 9878827. DOI: 10.2174/1389202923666220527112929.
Evolution of isoform-level gene expression patterns across tissues during lotus species divergence.
Zhang Y, Yang X, Van de Peer Y, Chen J, Marchal K, Shi T Plant J. 2022; 112(3):830-846.
PMID: 36123806 PMC: 7613771. DOI: 10.1111/tpj.15984.