» Articles » PMID: 35044812

Widespread Occurrence of Hybrid Internal-terminal Exons in Human Transcriptomes

Overview
Journal Sci Adv
Specialties Biology
Science
Date 2022 Jan 19
PMID 35044812
Authors
Affiliations
Soon will be listed here.
Abstract

Messenger RNA isoform differences are predominantly driven by alternative first, internal, and last exons. Despite the importance of classifying exons to understand isoform structure, few tools examine isoform-specific exon usage. We recently observed that alternative transcription start sites often arise near internal exons, often creating “hybrid” first/internal exons. To systematically detect hybrid exons, we built the hybrid-internal-terminal (HIT) pipeline to classify exons depending on their isoform-specific usage. On the basis of splice junction reads in RNA sequencing data and probabilistic modeling, the HIT index identified thousands of previously misclassified hybrid first-internal and internal-last exons. Hybrid exons are enriched in long genes and genes involved in RNA splicing and have longer flanking introns and strong splice sites. Their usage varies considerably across human tissues. By developing the first method to classify exons according to isoform contexts, our findings document the occurrence of hybrid exons, a common quirk of the human transcriptome.

Citing Articles

Hybrid exons evolved by coupling transcription initiation and splicing at the nucleotide level.

Mick S, Carroll C, Uriostegui-Arcos M, Fiszbein A Nucleic Acids Res. 2024; 53(3).

PMID: 39739742 PMC: 11797052. DOI: 10.1093/nar/gkae1251.


Challenges in identifying mRNA transcript starts and ends from long-read sequencing data.

Calvo-Roitberg E, Daniels R, Pai A Genome Res. 2024; 34(11):1719-1734.

PMID: 39567236 PMC: 11610588. DOI: 10.1101/gr.279559.124.


Biosurfer for systematic tracking of regulatory mechanisms leading to protein isoform diversity.

Murali M, Saquing J, Lu S, Gao Z, Jordan B, Wakefield Z bioRxiv. 2024; .

PMID: 38559226 PMC: 10980011. DOI: 10.1101/2024.03.15.585320.


mRNA initiation and termination are spatially coordinated.

Calvo-Roitberg E, Carroll C, Venev S, Kim G, Mick S, Dekker J bioRxiv. 2024; .

PMID: 38260419 PMC: 10802295. DOI: 10.1101/2024.01.05.574404.


TE-TSS: an integrated data resource of human and mouse transposable element (TE)-derived transcription start site (TSS).

Gu X, Wang M, Zhang X Nucleic Acids Res. 2023; 52(D1):D322-D333.

PMID: 37956335 PMC: 10767810. DOI: 10.1093/nar/gkad1048.


References
1.
Vo Ngoc L, Cassidy C, Huang C, Duttke S, Kadonaga J . The human initiator is a distinct and abundant element that is precisely positioned in focused core promoters. Genes Dev. 2017; 31(1):6-11. PMC: 5287114. DOI: 10.1101/gad.293837.116. View

2.
Pai A, Luca F . Environmental influences on RNA processing: Biochemical, molecular and genetic regulators of cellular response. Wiley Interdiscip Rev RNA. 2018; 10(1):e1503. PMC: 6294667. DOI: 10.1002/wrna.1503. View

3.
Hadadian Nejad Yousefi M, Goudarzi M, Motahari S . IMOS: improved Meta-aligner and Minimap2 On Spark. BMC Bioinformatics. 2019; 20(1):51. PMC: 6345043. DOI: 10.1186/s12859-018-2592-5. View

4.
Reimer K, Mimoso C, Adelman K, Neugebauer K . Co-transcriptional splicing regulates 3' end cleavage during mammalian erythropoiesis. Mol Cell. 2021; 81(5):998-1012.e7. PMC: 8038867. DOI: 10.1016/j.molcel.2020.12.018. View

5.
Yeo G, Burge C . Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals. J Comput Biol. 2004; 11(2-3):377-94. DOI: 10.1089/1066527041410418. View