» Articles » PMID: 37013607

Confounding Factors in Profiling of Locus-specific Human Endogenous Retrovirus (HERV) Transcript Signatures in Primary T Cells Using Multi-study-derived Datasets

Overview
Publisher Biomed Central
Specialty Genetics
Date 2023 Apr 4
PMID 37013607
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Human endogenous retroviruses (HERV) are repetitive sequence elements and a substantial part of the human genome. Their role in development has been well documented and there is now mounting evidence that dysregulated HERV expression also contributes to various human diseases. While research on HERV elements has in the past been hampered by their high sequence similarity, advanced sequencing technology and analytical tools have empowered the field. For the first time, we are now able to undertake locus-specific HERV analysis, deciphering expression patterns, regulatory networks and biological functions of these elements. To do so, we inevitable rely on omics datasets available through the public domain. However, technical parameters inevitably differ, making inter-study analysis challenging. We here address the issue of confounding factors for profiling locus-specific HERV transcriptomes using datasets from multiple sources.

Methods: We collected RNAseq datasets of CD4 and CD8 primary T cells and extracted HERV expression profiles for 3220 elements, resembling most intact, near full-length proviruses. Looking at sequencing parameters and batch effects, we compared HERV signatures across datasets and determined permissive features for HERV expression analysis from multiple-source data.

Results: We could demonstrate that considering sequencing parameters, sequencing-depth is most influential on HERV signature outcome. Sequencing samples deeper broadens the spectrum of expressed HERV elements. Sequencing mode and read length are secondary parameters. Nevertheless, we find that HERV signatures from smaller RNAseq datasets do reliably reveal most abundantly expressed HERV elements. Overall, HERV signatures between samples and studies overlap substantially, indicating a robust HERV transcript signature in CD4 and CD8 T cells. Moreover, we find that measures of batch effect reduction are critical to uncover genic and HERV expression differences between cell types. After doing so, differences in the HERV transcriptome between ontologically closely related CD4 and CD8 T cells became apparent.

Conclusion: In our systematic approach to determine sequencing and analysis parameters for detection of locus-specific HERV expression, we provide evidence that analysis of RNAseq datasets from multiple studies can aid confidence of biological findings. When generating de novo HERV expression datasets we recommend increased sequence depth ( > = 100 mio reads) compared to standard genic transcriptome pipelines. Finally, batch effect reduction measures need to be implemented to allow for differential expression analysis.

Citing Articles

Targeted Variant Assessments of Human Endogenous Retroviral Regions in Whole Genome Sequencing Data Reveal Retroviral Variants Associated with Papillary Thyroid Cancer.

Stricker E, Peckham-Gregory E, Lai S, Sandulache V, Scheurer M Microorganisms. 2025; 12(12.

PMID: 39770638 PMC: 11679660. DOI: 10.3390/microorganisms12122435.


Cell-Specific Transposable Element and Gene Expression Analysis Across Systemic Lupus Erythematosus Phenotypes.

Cutts Z, Patterson S, Maliskova L, Taylor K, Ye C, DallEra M ACR Open Rheumatol. 2024; 6(11):769-779.

PMID: 39143499 PMC: 11557995. DOI: 10.1002/acr2.11713.


CancerHERVdb: Human Endogenous Retrovirus (HERV) Expression Database for Human Cancer Accelerates Studies of the Retrovirome and Predictions for HERV-Based Therapies.

Stricker E, Peckham-Gregory E, Scheurer M J Virol. 2023; 97(6):e0005923.

PMID: 37255431 PMC: 10308937. DOI: 10.1128/jvi.00059-23.

References
1.
Anders S, Pyl P, Huber W . HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics. 2014; 31(2):166-9. PMC: 4287950. DOI: 10.1093/bioinformatics/btu638. View

2.
Dembny P, Newman A, Singh M, Hinz M, Szczepek M, Kruger C . Human endogenous retrovirus HERV-K(HML-2) RNA causes neurodegeneration through Toll-like receptors. JCI Insight. 2020; 5(7). PMC: 7205273. DOI: 10.1172/jci.insight.131093. View

3.
She J, Du M, Xu Z, Jin Y, Li Y, Zhang D . The landscape of hervRNAs transcribed from human endogenous retroviruses across human body sites. Genome Biol. 2022; 23(1):231. PMC: 9632151. DOI: 10.1186/s13059-022-02804-w. View

4.
Goke J, Lu X, Chan Y, Ng H, Ly L, Sachs F . Dynamic transcription of distinct classes of endogenous retroviral elements marks specific populations of early human embryonic cells. Cell Stem Cell. 2015; 16(2):135-41. DOI: 10.1016/j.stem.2015.01.005. View

5.
Fueyo R, Judd J, Feschotte C, Wysocka J . Roles of transposable elements in the regulation of mammalian transcription. Nat Rev Mol Cell Biol. 2022; 23(7):481-497. PMC: 10470143. DOI: 10.1038/s41580-022-00457-y. View