» Articles » PMID: 16500941

A Systematic Comparison and Evaluation of Biclustering Methods for Gene Expression Data

Overview
Journal Bioinformatics
Specialty Biology
Date 2006 Feb 28
PMID 16500941
Citations 192
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: In recent years, there have been various efforts to overcome the limitations of standard clustering approaches for the analysis of gene expression data by grouping genes and samples simultaneously. The underlying concept, which is often referred to as biclustering, allows to identify sets of genes sharing compatible expression patterns across subsets of samples, and its usefulness has been demonstrated for different organisms and datasets. Several biclustering methods have been proposed in the literature; however, it is not clear how the different techniques compare with each other with respect to the biological relevance of the clusters as well as with other characteristics such as robustness and sensitivity to noise. Accordingly, no guidelines concerning the choice of the biclustering method are currently available.

Results: First, this paper provides a methodology for comparing and validating biclustering methods that includes a simple binary reference model. Although this model captures the essential features of most biclustering approaches, it is still simple enough to exactly determine all optimal groupings; to this end, we propose a fast divide-and-conquer algorithm (Bimax). Second, we evaluate the performance of five salient biclustering algorithms together with the reference model and a hierarchical clustering method on various synthetic and real datasets for Saccharomyces cerevisiae and Arabidopsis thaliana. The comparison reveals that (1) biclustering in general has advantages over a conventional hierarchical clustering approach, (2) there are considerable performance differences between the tested methods and (3) already the simple reference model delivers relevant patterns within all considered settings.

Citing Articles

A personalized reinforcement learning recommendation algorithm using bi-clustering techniques.

Waqar M, Ayub M PLoS One. 2025; 20(2):e0315533.

PMID: 39977407 PMC: 11841880. DOI: 10.1371/journal.pone.0315533.


Online-adjusted evolutionary biclustering algorithm to identify significant modules in gene expression data.

Galindo-Hernandez R, Rodriguez-Vazquez K, Galan-Vasquez E, Hernandez Castellanos C Brief Bioinform. 2025; 26(1).

PMID: 39749664 PMC: 11695933. DOI: 10.1093/bib/bbae681.


Uncovering hidden gene-trait patterns through biclustering analysis of the UK Biobank.

Pividori M, Sadeeq S, Krishnan A, Stranger B, Gignoux C bioRxiv. 2024; .

PMID: 39605717 PMC: 11601405. DOI: 10.1101/2024.11.08.622657.


A parameter free relative density based biclustering method for identifying non-linear feature relations.

Jain N, Ghosh S, Ghosh A Heliyon. 2024; 10(15):e34736.

PMID: 39157398 PMC: 11327522. DOI: 10.1016/j.heliyon.2024.e34736.


Biclustering data analysis: a comprehensive survey.

Castanho E, Aidos H, Madeira S Brief Bioinform. 2024; 25(4).

PMID: 39007596 PMC: 11247412. DOI: 10.1093/bib/bbae342.