Application of a Bioinformatic Pipeline to RNA-seq Data Identifies Novel Virus-like Sequence in Human Blood
Overview
Molecular Biology
Authors
Affiliations
Numerous reports have suggested that infectious agents could play a role in neurodegenerative diseases, but specific etiological agents have not been convincingly demonstrated. To search for candidate agents in an unbiased fashion, we have developed a bioinformatic pipeline that identifies microbial sequences in mammalian RNA-seq data, including sequences with no significant nucleotide similarity hits in GenBank. Effectiveness of the pipeline was tested using publicly available RNA-seq data and in a reconstruction experiment using synthetic data. We then applied this pipeline to a novel RNA-seq dataset generated from a cohort of 120 samples from amyotrophic lateral sclerosis patients and controls, and identified sequences corresponding to known bacteria and viruses, as well as novel virus-like sequences. The presence of these novel virus-like sequences, which were identified in subsets of both patients and controls, were confirmed by quantitative RT-PCR. We believe this pipeline will be a useful tool for the identification of potential etiological agents in the many RNA-seq datasets currently being generated.
El-Sayed A, Shindia A, Emam E, Labib M, El-Deen E, Seadawy M Sci Rep. 2024; 14(1):27715.
PMID: 39532921 PMC: 11557573. DOI: 10.1038/s41598-024-78368-5.
Shin D, Kim J, Lee J, Kim J, Oh Y Int J Chron Obstruct Pulmon Dis. 2023; 18:2531-2542.
PMID: 38022823 PMC: 10644840. DOI: 10.2147/COPD.S426260.
Grima N, Liu S, Southwood D, Henden L, Smith A, Lee A Neuropathol Appl Neurobiol. 2023; 49(6):e12943.
PMID: 37818590 PMC: 10946588. DOI: 10.1111/nan.12943.
Link C Neurosci Insights. 2021; 16:26331055211018709.
PMID: 34104888 PMC: 8165828. DOI: 10.1177/26331055211018709.