» Articles » PMID: 34214174

Rank-in: Enabling Integrative Analysis Across Microarray and RNA-seq for Cancer

Overview
Specialty Biochemistry
Date 2021 Jul 2
PMID 34214174
Citations 27
Authors
Affiliations
Soon will be listed here.
Abstract

Though transcriptomics technologies evolve rapidly in the past decades, integrative analysis of mixed data between microarray and RNA-seq remains challenging due to the inherent variability difference between them. Here, Rank-In was proposed to correct the nonbiological effects across the two technologies, enabling freely blended data for consolidated analysis. Rank-In was rigorously validated via the public cell and tissue samples tested by both technologies. On the two reference samples of the SEQC project, Rank-In not only perfectly classified the 44 profiles but also achieved the best accuracy of 0.9 on predicting TaqMan-validated DEGs. More importantly, on 327 Glioblastoma (GBM) profiles and 248, 523 heterogeneous colon cancer profiles respectively, only Rank-In can successfully discriminate every single cancer profile from normal controls, while the others cannot. Further on different sizes of mixed seq-array GBM profiles, Rank-In can robustly reproduce a median range of DEG overlapping from 0.74 to 0.83 among top genes, whereas the others never exceed 0.72. Being the first effective method enabling mixed data of cross-technology analysis, Rank-In welcomes hybrid of array and seq profiles for integrative study on large/small, paired/unpaired and balanced/imbalanced samples, opening possibility to reduce sampling space of clinical cancer patients. Rank-In can be accessed at http://www.badd-cao.net/rank-in/index.html.

Citing Articles

Transcriptomic profiling reveals mechanism, therapeutic potential, and prognostic value of cancer stemness characteristic in nasopharyngeal carcinoma.

Chen J, Shen R, Zhu J, Wang Y, Fu L, Chen Y Funct Integr Genomics. 2025; 25(1):56.

PMID: 40053129 DOI: 10.1007/s10142-025-01561-w.


Critical role of Oas1g and STAT1 pathways in neuroinflammation: insights for Alzheimer's disease therapeutics.

Xie Z, Li L, Hou W, Fan Z, Zeng L, He L J Transl Med. 2025; 23(1):182.

PMID: 39953505 PMC: 11829366. DOI: 10.1186/s12967-025-06112-2.


Circadian gene signatures in the progression of obesity based on machine learning and Mendelian randomization analysis.

Cheng Z, Liu B, Liu X Front Nutr. 2024; 11:1407265.

PMID: 39351493 PMC: 11439728. DOI: 10.3389/fnut.2024.1407265.


Unveiling circRNA-mediated ceRNA networks in ischemic stroke by integrative analysis of multi-source gene expression profiling.

Zhang Y, Zhang X Heliyon. 2024; 10(17):e36988.

PMID: 39281538 PMC: 11402246. DOI: 10.1016/j.heliyon.2024.e36988.


Identification of hub genes associated with neutrophils in chronic rhinosinusitis with nasal polyps.

Guo Y, Sun Q, Yin J, Mou Y, Wang J, Wang Y Sci Rep. 2024; 14(1):19870.

PMID: 39191825 PMC: 11350000. DOI: 10.1038/s41598-024-70387-6.


References
1.
Wang P, Yang Y, Han W, Ma D . ImmuSort, a database on gene plasticity and electronic sorting for immune cells. Sci Rep. 2015; 5:10370. PMC: 4437374. DOI: 10.1038/srep10370. View

2.
Tang Z, Han L, Lin H, Cui J, Jia J, Low B . Derivation of stable microarray cancer-differentiating signatures using consensus scoring of multiple random sampling and gene-ranking consistency evaluation. Cancer Res. 2007; 67(20):9996-10003. DOI: 10.1158/0008-5472.CAN-07-1601. View

3.
Johnson W, Li C, Rabinovic A . Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2006; 8(1):118-27. DOI: 10.1093/biostatistics/kxj037. View

4.
Dembele D . A Flexible Microarray Data Simulation Model. Microarrays (Basel). 2016; 2(2):115-30. PMC: 5003477. DOI: 10.3390/microarrays2020115. View

5.
Xu L, Luo H, Wang R, Wu W, Phue J, Shen R . Novel reference genes in colorectal cancer identify a distinct subset of high stage tumors and their associated histologically normal colonic tissues. BMC Med Genet. 2019; 20(1):138. PMC: 6693228. DOI: 10.1186/s12881-019-0867-y. View