» Articles » PMID: 39149898

Improved Sub-genomic RNA Prediction with the ARTIC Protocol

Overview
Specialty Biochemistry
Date 2024 Aug 16
PMID 39149898
Authors
Affiliations
Soon will be listed here.
Abstract

Viral subgenomic RNA (sgRNA) plays a major role in SARS-COV2's replication, pathogenicity, and evolution. Recent sequencing protocols, such as the ARTIC protocol, have been established. However, due to the viral-specific biological processes, analyzing sgRNA through viral-specific read sequencing data is a computational challenge. Current methods rely on computational tools designed for eukaryote genomes, resulting in a gap in the tools designed specifically for sgRNA detection. To address this, we make two contributions. Firstly, we present sgENERATE, an evaluation pipeline to study the accuracy and efficacy of sgRNA detection tools using the popular ARTIC sequencing protocol. Using sgENERATE, we evaluate periscope, a recently introduced tool that detects sgRNA from ARTIC sequencing data. We find that periscope has biased predictions and high computational costs. Secondly, using the information produced from sgENERATE, we redesign the algorithm in periscope to use multiple references from canonical sgRNAs to mitigate alignment issues and improve sgRNA and non-canonical sgRNA detection. We evaluate periscope and our algorithm, periscope_multi, on simulated and biological sequencing datasets and demonstrate periscope_multi's enhanced sgRNA detection accuracy. Our contribution advances tools for studying viral sgRNA, paving the way for more accurate and efficient analyses in the context of viral RNA discovery.

References
1.
Rakocevic G, Semenyuk V, Lee W, Spencer J, Browning J, Johnson I . Fast and accurate genomic analyses using genome graphs. Nat Genet. 2019; 51(2):354-362. DOI: 10.1038/s41588-018-0316-4. View

2.
Kim K, Park K, Lee S, Baek S, Lim T, Kim J . VirPipe: an easy-to-use and customizable pipeline for detecting viral genomes from Nanopore sequencing. Bioinformatics. 2023; 39(5). PMC: 10191607. DOI: 10.1093/bioinformatics/btad293. View

3.
Cock P, Antao T, Chang J, Chapman B, Cox C, Dalke A . Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009; 25(11):1422-3. PMC: 2682512. DOI: 10.1093/bioinformatics/btp163. View

4.
Di Gioacchino A, Legendre R, Rahou Y, Najburg V, Charneau P, Greenbaum B . sgDI-tector: defective interfering viral genome bioinformatics for detection of coronavirus subgenomic RNAs. RNA. 2021; 28(3):277-289. PMC: 8848934. DOI: 10.1261/rna.078969.121. View

5.
Lamers M, Haagmans B . SARS-CoV-2 pathogenesis. Nat Rev Microbiol. 2022; 20(5):270-284. DOI: 10.1038/s41579-022-00713-0. View