» Articles » PMID: 34390351

Transposable Element Sequence Fragments Incorporated into Coding and Noncoding Transcripts Modulate the Transcriptome of Human Pluripotent Stem Cells

Abstract

Transposable elements (TEs) occupy nearly 40% of mammalian genomes and, whilst most are fragmentary and no longer capable of transposition, they can nevertheless contribute to cell function. TEs within genes transcribed by RNA polymerase II can be copied as parts of primary transcripts; however, their full contribution to mature transcript sequences remains unresolved. Here, using long and short read (LR and SR) RNA sequencing data, we show that 26% of coding and 65% of noncoding transcripts in human pluripotent stem cells (hPSCs) contain TE-derived sequences. Different TE families are incorporated into RNAs in unique patterns, with consequences to transcript structure and function. The presence of TE sequences within a transcript is correlated with TE-type specific changes in its subcellular distribution, alterations in steady-state levels and half-life, and differential association with RNA Binding Proteins (RBPs). We identify hPSC-specific incorporation of endogenous retroviruses (ERVs) and LINE:L1 into protein-coding mRNAs, which generate TE sequence-derived peptides. Finally, single cell RNA-seq reveals that hPSCs express ERV-containing transcripts, whilst differentiating subpopulations lack ERVs and express SINE and LINE-containing transcripts. Overall, our comprehensive analysis demonstrates that the incorporation of TE sequences into the RNAs of hPSCs is more widespread and has a greater impact than previously appreciated.

Citing Articles

Long-read RNA sequencing enables full-length chimeric transcript annotation of transposable elements in lung adenocarcinoma.

Li Y, Liu Y, Xie Y, Wang Y, Wang J, Wang H BMC Cancer. 2025; 25(1):482.

PMID: 40089719 DOI: 10.1186/s12885-025-13888-5.


The nuclear matrix stabilizes primed-specific genes in human pluripotent stem cells.

Ma G, Fu X, Zhou L, Babarinde I, Shi L, Yang W Nat Cell Biol. 2025; 27(2):232-245.

PMID: 39789220 DOI: 10.1038/s41556-024-01595-5.


Functional Bidirectionality of ERV-Derived Long Non-Coding RNAs in Humans.

Song Y, Wen H, Zhai X, Jia L, Li L Int J Mol Sci. 2024; 25(19).

PMID: 39408810 PMC: 11476766. DOI: 10.3390/ijms251910481.


Novel function of U7 snRNA in the repression of HERV1/LTR12s and lincRNAs in human cells.

Plewka P, Szczesniak M, Stepien A, Pasieka R, Wanowska E, Makalowska I Nucleic Acids Res. 2024; 52(17):10504-10519.

PMID: 39189459 PMC: 11417402. DOI: 10.1093/nar/gkae738.


HiTE: a fast and accurate dynamic boundary adjustment approach for full-length transposable element detection and annotation.

Hu K, Ni P, Xu M, Zou Y, Chang J, Gao X Nat Commun. 2024; 15(1):5573.

PMID: 38956036 PMC: 11219922. DOI: 10.1038/s41467-024-49912-8.


References
1.
Cheng L, Zheng D, Baljinnyam E, Sun F, Ogami K, Yeung P . Widespread transcript shortening through alternative polyadenylation in secretory cell differentiation. Nat Commun. 2020; 11(1):3182. PMC: 7311474. DOI: 10.1038/s41467-020-16959-2. View

2.
Mirauta B, Seaton D, Bensaddek D, Brenes A, Bonder M, Kilpinen H . Population-scale proteome variation in human induced pluripotent stem cells. Elife. 2020; 9. PMC: 7447446. DOI: 10.7554/eLife.57390. View

3.
Abascal F, Juan D, Jungreis I, Kellis M, Martinez L, Rigau M . Loose ends: almost one in five human genes still have unresolved coding status. Nucleic Acids Res. 2018; 46(14):7070-7084. PMC: 6101605. DOI: 10.1093/nar/gky587. View

4.
Clayton E, Rishishwar L, Huang T, Gulati S, Ban D, McDonald J . An atlas of transposable element-derived alternative splicing in cancer. Philos Trans R Soc Lond B Biol Sci. 2020; 375(1795):20190342. PMC: 7061986. DOI: 10.1098/rstb.2019.0342. View

5.
Burns K . Transposable elements in cancer. Nat Rev Cancer. 2017; 17(7):415-424. DOI: 10.1038/nrc.2017.35. View