Gene Identification Signature (GIS) Analysis for Transcriptome Characterization and Genome Annotation
Overview
Pathology
Authors
Affiliations
We have developed a DNA tag sequencing and mapping strategy called gene identification signature (GIS) analysis, in which 5' and 3' signatures of full-length cDNAs are accurately extracted into paired-end ditags (PETs) that are concatenated for efficient sequencing and mapped to genome sequences to demarcate the transcription boundaries of every gene. GIS analysis is potentially 30-fold more efficient than standard cDNA sequencing approaches for transcriptome characterization. We demonstrated this approach with 116,252 PET sequences derived from mouse embryonic stem cells. Initial analysis of this dataset identified hundreds of previously uncharacterized transcripts, including alternative transcripts of known genes. We also uncovered several intergenically spliced and unusual fusion transcripts, one of which was confirmed as a trans-splicing event and was differentially expressed. The concept of paired-end ditagging described here for transcriptome analysis can also be applied to whole-genome analysis of cis-regulatory and other DNA elements and represents an important technological advance for genome annotation.
Hu X, Fan Y, Mao C, Chen H, Wang Q Front Microbiol. 2023; 14:1111794.
PMID: 36819037 PMC: 9936982. DOI: 10.3389/fmicb.2023.1111794.
Non-coding antisense transcripts: fine regulation of gene expression in cancer.
Santos F, Capela A, Mateus F, Nobrega-Pereira S, de Jesus B Comput Struct Biotechnol J. 2022; 20:5652-5660.
PMID: 36284703 PMC: 9579725. DOI: 10.1016/j.csbj.2022.10.009.
RetroScan: An Easy-to-Use Pipeline for Retrocopy Annotation and Visualization.
Wei Z, Sun J, Li Q, Yao T, Zeng H, Wang Y Front Genet. 2021; 12:719204.
PMID: 34484306 PMC: 8415311. DOI: 10.3389/fgene.2021.719204.
Moretto F, Wood N, Chia M, Li C, Luscombe N, van Werven F Cell Rep. 2021; 34(3):108643.
PMID: 33472063 PMC: 7816125. DOI: 10.1016/j.celrep.2020.108643.
Chia M, Li C, Marques S, Pelechano V, Luscombe N, van Werven F Genome Biol. 2021; 22(1):34.
PMID: 33446241 PMC: 7807719. DOI: 10.1186/s13059-020-02245-3.