Widespread Occurrence of Hybrid Internal-terminal Exons in Human Transcriptomes
Authors
Affiliations
Messenger RNA isoform differences are predominantly driven by alternative first, internal, and last exons. Despite the importance of classifying exons to understand isoform structure, few tools examine isoform-specific exon usage. We recently observed that alternative transcription start sites often arise near internal exons, often creating “hybrid” first/internal exons. To systematically detect hybrid exons, we built the hybrid-internal-terminal (HIT) pipeline to classify exons depending on their isoform-specific usage. On the basis of splice junction reads in RNA sequencing data and probabilistic modeling, the HIT index identified thousands of previously misclassified hybrid first-internal and internal-last exons. Hybrid exons are enriched in long genes and genes involved in RNA splicing and have longer flanking introns and strong splice sites. Their usage varies considerably across human tissues. By developing the first method to classify exons according to isoform contexts, our findings document the occurrence of hybrid exons, a common quirk of the human transcriptome.
Hybrid exons evolved by coupling transcription initiation and splicing at the nucleotide level.
Mick S, Carroll C, Uriostegui-Arcos M, Fiszbein A Nucleic Acids Res. 2024; 53(3).
PMID: 39739742 PMC: 11797052. DOI: 10.1093/nar/gkae1251.
Challenges in identifying mRNA transcript starts and ends from long-read sequencing data.
Calvo-Roitberg E, Daniels R, Pai A Genome Res. 2024; 34(11):1719-1734.
PMID: 39567236 PMC: 11610588. DOI: 10.1101/gr.279559.124.
Biosurfer for systematic tracking of regulatory mechanisms leading to protein isoform diversity.
Murali M, Saquing J, Lu S, Gao Z, Jordan B, Wakefield Z bioRxiv. 2024; .
PMID: 38559226 PMC: 10980011. DOI: 10.1101/2024.03.15.585320.
mRNA initiation and termination are spatially coordinated.
Calvo-Roitberg E, Carroll C, Venev S, Kim G, Mick S, Dekker J bioRxiv. 2024; .
PMID: 38260419 PMC: 10802295. DOI: 10.1101/2024.01.05.574404.
Gu X, Wang M, Zhang X Nucleic Acids Res. 2023; 52(D1):D322-D333.
PMID: 37956335 PMC: 10767810. DOI: 10.1093/nar/gkad1048.