» Articles » PMID: 22247279

Robust Rank Aggregation for Gene List Integration and Meta-analysis

Overview
Journal Bioinformatics
Specialty Biology
Date 2012 Jan 17
PMID 22247279
Citations 526
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: The continued progress in developing technological platforms, availability of many published experimental datasets, as well as different statistical methods to analyze those data have allowed approaching the same research question using various methods simultaneously. To get the best out of all these alternatives, we need to integrate their results in an unbiased manner. Prioritized gene lists are a common result presentation method in genomic data analysis applications. Thus, the rank aggregation methods can become a useful and general solution for the integration task.

Results: Standard rank aggregation methods are often ill-suited for biological settings where the gene lists are inherently noisy. As a remedy, we propose a novel robust rank aggregation (RRA) method. Our method detects genes that are ranked consistently better than expected under null hypothesis of uncorrelated inputs and assigns a significance score for each gene. The underlying probabilistic model makes the algorithm parameter free and robust to outliers, noise and errors. Significance scores also provide a rigorous way to keep only the statistically relevant genes in the final list. These properties make our approach robust and compelling for many settings.

Availability: All the methods are implemented as a GNU R package RobustRankAggreg, freely available at the Comprehensive R Archive Network http://cran.r-project.org/.

Citing Articles

Expression of ENL YEATS domain tumor mutations in nephrogenic or stromal lineage impairs kidney development.

Xue Z, Xuan H, Lau K, Su Y, Wegener M, Li K Nat Commun. 2025; 16(1):2531.

PMID: 40087269 DOI: 10.1038/s41467-025-57926-z.


Comprehensive bioinformatics analysis reveals key hub genes linked to prognosis in multiple myeloma with drug resistance.

Chen X, Wu Y, Li Y, Chen Q, Yao L, Lin L Medicine (Baltimore). 2025; 104(10):e41707.

PMID: 40068082 PMC: 11902958. DOI: 10.1097/MD.0000000000041707.


Identification and multi-omics analysis of essential coding and long non-coding genes in colorectal cancer.

Li Y, Meng Z, Fan C, Rong H, Xi Y, Liao Q Biochem Biophys Rep. 2025; 41:101938.

PMID: 40034256 PMC: 11874739. DOI: 10.1016/j.bbrep.2025.101938.


PRODE recovers essential and context-essential genes through neighborhood-informed scores.

Cantore T, Gasperini P, Bevilacqua R, Ciani Y, Sinha S, Ruppin E Genome Biol. 2025; 26(1):42.

PMID: 40022167 PMC: 11869679. DOI: 10.1186/s13059-025-03501-0.


Investigating the epigenetic landscape of symptomatic disk degeneration: a case study.

Yeater T, Kawarai Y, Lee S, Belani K, Beebe D, Sheyn D Pain Rep. 2025; 10(2):e1237.

PMID: 39995491 PMC: 11850048. DOI: 10.1097/PR9.0000000000001237.


References
1.
Miller B, Stamatoyannopoulos J . Integrative meta-analysis of differential gene expression in acute myeloid leukemia. PLoS One. 2010; 5(3):e9466. PMC: 2830886. DOI: 10.1371/journal.pone.0009466. View

2.
Reimand J, Kull M, Peterson H, Hansen J, Vilo J . g:Profiler--a web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Res. 2007; 35(Web Server issue):W193-200. PMC: 1933153. DOI: 10.1093/nar/gkm226. View

3.
Hu Z, Killion P, Iyer V . Genetic reconstruction of a functional transcriptional regulatory network. Nat Genet. 2007; 39(5):683-7. DOI: 10.1038/ng2012. View

4.
Pihur V, Datta S, Datta S . Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach. Bioinformatics. 2007; 23(13):1607-15. DOI: 10.1093/bioinformatics/btm158. View

5.
Pihur V, Datta S, Datta S . Finding common genes in multiple cancer types through meta-analysis of microarray experiments: a rank aggregation approach. Genomics. 2008; 92(6):400-3. DOI: 10.1016/j.ygeno.2008.05.003. View