» Articles » PMID: 38730241

Comprehensive Assessment of MRNA Isoform Detection Methods for Long-read Sequencing Data

Overview
Journal Nat Commun
Specialty Biology
Date 2024 May 10
PMID 38730241
Authors
Affiliations
Soon will be listed here.
Abstract

The advancement of Long-Read Sequencing (LRS) techniques has significantly increased the length of sequencing to several kilobases, thereby facilitating the identification of alternative splicing events and isoform expressions. Recently, numerous computational tools for isoform detection using long-read sequencing data have been developed. Nevertheless, there remains a deficiency in comparative studies that systemically evaluate the performance of these tools, which are implemented with different algorithms, under various simulations that encompass potential influencing factors. In this study, we conducted a benchmark analysis of thirteen methods implemented in nine tools capable of identifying isoform structures from long-read RNA-seq data. We evaluated their performances using simulated data, which represented diverse sequencing platforms generated by an in-house simulator, RNA sequins (sequencing spike-ins) data, as well as experimental data. Our findings demonstrate IsoQuant as a highly effective tool for isoform detection with LRS, with Bambu and StringTie2 also exhibiting strong performance. These results offer valuable guidance for future research on alternative splicing analysis and the ongoing improvement of tools for isoform detection using LRS data.

Citing Articles

Understanding isoform expression by pairing long-read sequencing with single-cell and spatial transcriptomics.

Belchikov N, Hsu J, Li X, Jarroux J, Hu W, Joglekar A Genome Res. 2024; 34(11):1735-1746.

PMID: 39567235 PMC: 11610585. DOI: 10.1101/gr.279640.124.


Long-read RNA sequencing: A transformative technology for exploring transcriptome complexity in human diseases.

Ament I, DeBruyne N, Wang F, Lin L Mol Ther. 2024; 33(3):883-894.

PMID: 39563027 PMC: 11897757. DOI: 10.1016/j.ymthe.2024.11.025.


Discovering the hidden function in fungal genomes.

Gervais N, Shapiro R Nat Commun. 2024; 15(1):8219.

PMID: 39300175 PMC: 11413187. DOI: 10.1038/s41467-024-52568-z.


Comprehensive assessment of mRNA isoform detection methods for long-read sequencing data.

Su Y, Yu Z, Jin S, Ai Z, Yuan R, Chen X Nat Commun. 2024; 15(1):3972.

PMID: 38730241 PMC: 11087464. DOI: 10.1038/s41467-024-48117-3.

References
1.
Chen Y, Sim A, Kei Wan Y, Yeo K, Lee J, Ling M . Context-aware transcript quantification from long-read RNA-seq data with Bambu. Nat Methods. 2023; 20(8):1187-1195. PMC: 10448944. DOI: 10.1038/s41592-023-01908-w. View

2.
. A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium. Nat Biotechnol. 2014; 32(9):903-14. PMC: 4321899. DOI: 10.1038/nbt.2957. View

3.
Ono Y, Hamada M, Asai K . PBSIM3: a simulator for all types of PacBio and ONT long reads. NAR Genom Bioinform. 2022; 4(4):lqac092. PMC: 9713900. DOI: 10.1093/nargab/lqac092. View

4.
Robinson J, Thorvaldsdottir H, Winckler W, Guttman M, Lander E, Getz G . Integrative genomics viewer. Nat Biotechnol. 2011; 29(1):24-6. PMC: 3346182. DOI: 10.1038/nbt.1754. View

5.
Lienhard M, van den Beucken T, Timmermann B, Hochradel M, Borno S, Caiment F . IsoTools: a flexible workflow for long-read transcriptome sequencing analysis. Bioinformatics. 2023; 39(6). PMC: 10287928. DOI: 10.1093/bioinformatics/btad364. View