» Articles » PMID: 21385047

Accurate Estimation of Expression Levels of Homologous Genes in RNA-seq Experiments

Overview
Journal J Comput Biol
Date 2011 Mar 10
PMID 21385047
Citations 24
Authors
Affiliations
Soon will be listed here.
Abstract

Abstract Next generation high-throughput sequencing (NGS) is poised to replace array-based technologies as the experiment of choice for measuring RNA expression levels. Several groups have demonstrated the power of this new approach (RNA-seq), making significant and novel contributions and simultaneously proposing methodologies for the analysis of RNA-seq data. In a typical experiment, millions of short sequences (reads) are sampled from RNA extracts and mapped back to a reference genome. The number of reads mapping to each gene is used as proxy for its corresponding RNA concentration. A significant challenge in analyzing RNA expression of homologous genes is the large fraction of the reads that map to multiple locations in the reference genome. Currently, these reads are either dropped from the analysis, or a naive algorithm is used to estimate their underlying distribution. In this work, we present a rigorous alternative for handling the reads generated in an RNA-seq experiment within a probabilistic model for RNA-seq data; we develop maximum likelihood-based methods for estimating the model parameters. In contrast to previous methods, our model takes into account the fact that the DNA of the sequenced individual is not a perfect copy of the reference sequence. We show with both simulated and real RNA-seq data that our new method improves the accuracy and power of RNA-seq experiments.

Citing Articles

A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs.

Soneson C, Love M, Patro R, Hussain S, Malhotra D, Robinson M Life Sci Alliance. 2019; 2(1).

PMID: 30655364 PMC: 6337739. DOI: 10.26508/lsa.201800175.


miR-MaGiC improves quantification accuracy for small RNA-seq.

Russell P, Vestal B, Shi W, Rudra P, Dowell R, Radcliffe R BMC Res Notes. 2018; 11(1):296.

PMID: 29764489 PMC: 5952827. DOI: 10.1186/s13104-018-3418-2.


Evaluation of Bioinformatics Approaches for Next-Generation Sequencing Analysis of microRNAs with a Toxicogenomics Study Design.

Bisgin H, Gong B, Wang Y, Tong W Front Genet. 2018; 9:22.

PMID: 29467792 PMC: 5808213. DOI: 10.3389/fgene.2018.00022.


Upregulated WEE1 protects endothelial cells of colorectal cancer liver metastases.

Webster P, Littlejohns A, Gaunt H, Young R, Rode B, Ritchie J Oncotarget. 2017; 8(26):42288-42299.

PMID: 28178688 PMC: 5522067. DOI: 10.18632/oncotarget.15039.


Efficient Approach to Correct Read Alignment for Pseudogene Abundance Estimates.

Ju C, Zhao Z, Wang W IEEE/ACM Trans Comput Biol Bioinform. 2016; 14(3):522-533.

PMID: 27429446 PMC: 5514313. DOI: 10.1109/TCBB.2016.2591533.