Allele-specific Expression Analysis Methods for High-density SNP Microarray Data
Overview
Affiliations
Motivation: In the past decade, a number of technologies to quantify allele-specific expression (ASE) in a genome-wide manner have become available to researchers. We investigate the application of single-nucleotide polymorphism (SNP) microarrays to this task, exploring data obtained from both cell lines and primary tissue for which both RNA and DNA profiles are available.
Results: We analyze data from two experiments that make use of high-density Illumina Infinium II genotyping arrays to measure ASE. We first preprocess each data set, which involves removal of outlier samples, careful normalization and a two-step filtering procedure to remove SNPs that show no evidence of expression in the samples being analyzed and calls that are clear genotyping errors. We then compare three different tests for detecting ASE, one of which has been previously published and two novel approaches. These tests vary at the level at which they operate (per SNP per individual or per SNP) and in the input data they require. Using SNPs from imprinted genes as true positives for ASE, we observe varying sensitivity for the different testing procedures that improves with increasing sample size. Methods that rely on RNA signal alone were found to perform best across a range of metrics. The top ranked SNPs recovered by all methods appear to be reasonable candidates for ASE.
Availability And Implementation: Analysis was carried out in R (http://www.R-project.org/) using existing functions.
Xavier J, Magno R, Russell R, de Almeida B, Jacinta-Fernandes A, Besouro-Duarte A Sci Rep. 2024; 14(1):22526.
PMID: 39341862 PMC: 11438911. DOI: 10.1038/s41598-024-72163-y.
Correia L, Magno R, Xavier J, de Almeida B, Duarte I, Esteves F NPJ Breast Cancer. 2022; 8(1):71.
PMID: 35676284 PMC: 9177727. DOI: 10.1038/s41523-022-00435-9.
Allele-specific miRNA-binding analysis identifies candidate target genes for breast cancer risk.
Jacinta-Fernandes A, Xavier J, Magno R, Lage J, Maia A NPJ Genom Med. 2020; 5:4.
PMID: 32128252 PMC: 7018948. DOI: 10.1038/s41525-019-0112-9.
Zhao C, Xie S, Wu H, Luan Y, Hu S, Ni J Sci Rep. 2019; 9(1):6334.
PMID: 31004110 PMC: 6474871. DOI: 10.1038/s41598-019-42815-5.
Whole transcriptome RNA-Seq allelic expression in human brain.
Smith R, Webb A, Papp A, Newman L, Handelman S, Suhy A BMC Genomics. 2013; 14:571.
PMID: 23968248 PMC: 3765493. DOI: 10.1186/1471-2164-14-571.