» Articles » PMID: 27899656

IDP-ASE: Haplotyping and Quantifying Allele-specific Expression at the Gene and Gene Isoform Level by Hybrid Sequencing

Overview
Specialty Biochemistry
Date 2016 Dec 1
PMID 27899656
Citations 28
Authors
Affiliations
Soon will be listed here.
Abstract

Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only.

Citing Articles

Long-read sequencing of an advanced cancer cohort resolves rearrangements, unravels haplotypes, and reveals methylation landscapes.

ONeill K, Pleasance E, Fan J, Akbari V, Chang G, Dixon K Cell Genom. 2024; 4(11):100674.

PMID: 39406235 PMC: 11605692. DOI: 10.1016/j.xgen.2024.100674.


Characterizing the allele-specific gene expression landscape in high hyperdiploid acute lymphoblastic leukemia with BASE.

Andersson J, Aydin E, Gunnarsson R, Lilljebjorn H, Fioretos T, Johansson B Sci Rep. 2024; 14(1):23181.

PMID: 39369032 PMC: 11455916. DOI: 10.1038/s41598-024-73743-8.


Single-cell long-read targeted sequencing reveals transcriptional variation in ovarian cancer.

Byrne A, Le D, Sereti K, Menon H, Vaidya S, Patel N Nat Commun. 2024; 15(1):6916.

PMID: 39134520 PMC: 11319652. DOI: 10.1038/s41467-024-51252-6.


Detecting haplotype-specific transcript variation in long reads with FLAIR2.

Tang A, Felton C, Hrabeta-Robinson E, Volden R, Vollmers C, Brooks A Genome Biol. 2024; 25(1):173.

PMID: 38956576 PMC: 11218413. DOI: 10.1186/s13059-024-03301-y.


Long-read RNA-seq demarcates - and -directed alternative RNA splicing.

Quinones-Valdez G, Amoah K, Xiao X bioRxiv. 2024; .

PMID: 38915585 PMC: 11195283. DOI: 10.1101/2024.06.14.599101.


References
1.
Au K, Underwood J, Lee L, Wong W . Improving PacBio long read accuracy by short read alignment. PLoS One. 2012; 7(10):e46679. PMC: 3464235. DOI: 10.1371/journal.pone.0046679. View

2.
Nik-Zainal S, Davies H, Staaf J, Ramakrishna M, Glodzik D, Zou X . Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature. 2016; 534(7605):47-54. PMC: 4910866. DOI: 10.1038/nature17676. View

3.
Romanel A, Lago S, Prandi D, Sboner A, Demichelis F . ASEQ: fast allele-specific studies from next-generation sequencing data. BMC Med Genomics. 2015; 8:9. PMC: 4363342. DOI: 10.1186/s12920-015-0084-2. View

4.
Baker C, Kajita S, Walker M, Saxl R, Raghupathy N, Choi K . PRDM9 drives evolutionary erosion of hotspots in Mus musculus through haplotype-specific initiation of meiotic recombination. PLoS Genet. 2015; 11(1):e1004916. PMC: 4287450. DOI: 10.1371/journal.pgen.1004916. View

5.
Bansal V, Halpern A, Axelrod N, Bafna V . An MCMC algorithm for haplotype assembly from whole-genome sequence data. Genome Res. 2008; 18(8):1336-46. PMC: 2493424. DOI: 10.1101/gr.077065.108. View