» Articles » PMID: 16646817

A Compendium to Ensure Computational Reproducibility in High-dimensional Classification Tasks

Overview
Date 2006 May 2
PMID 16646817
Citations 30
Authors
Affiliations
Soon will be listed here.
Abstract

We demonstrate a concept and implementation of a compendium for the classification of high-dimensional data from microarray gene expression profiles. A compendium is an interactive document that bundles primary data, statistical processing methods, figures, and derived data together with the textual documentation and conclusions. Interactivity allows the reader to modify and extend these components. We address the following questions: how much does the discriminatory power of a classifier depend on the choice of the algorithm that was used to identify it; what alternative classifiers could be used just as well; how robust is the result. The answers to these questions are essential prerequisites for validation and biological interpretation of the classifiers. We show how to use this approach by looking at these questions for a specific breast cancer microarray data set that first has been studied by Huang et al. (2003).

Citing Articles

Noninvasive machine-learning models for the detection of lesion-specific ischemia in patients with stable angina with intermediate stenosis severity on coronary CT angiography.

Hamasaki H, Arimura H, Yamasaki Y, Yamamoto T, Fukata M, Matoba T Phys Eng Sci Med. 2024; .

PMID: 39739189 DOI: 10.1007/s13246-024-01503-z.


Statistical analysis of high-dimensional biomedical data: a gentle introduction to analytical goals, common approaches and challenges.

Rahnenfuhrer J, De Bin R, Benner A, Ambrogi F, Lusa L, Boulesteix A BMC Med. 2023; 21(1):182.

PMID: 37189125 PMC: 10186672. DOI: 10.1186/s12916-023-02858-y.


Patterns of risk-Using machine learning and structural neuroimaging to identify pedophilic offenders.

Popovic D, Wertz M, Geisler C, Kaufmann J, Lahteenvuo M, Lieslehto J Front Psychiatry. 2023; 14:1001085.

PMID: 37151966 PMC: 10157073. DOI: 10.3389/fpsyt.2023.1001085.


Similarities and differences between multivariate patterns of cognitive and socio-cognitive deficits in schizophrenia, bipolar disorder and related risk.

Raio A, Pergola G, Rampino A, Russo M, DAmbrosio E, Selvaggi P Schizophrenia (Heidelb). 2023; 9(1):11.

PMID: 36801866 PMC: 9938280. DOI: 10.1038/s41537-023-00337-0.


Machine learning-based ability to classify psychosis and early stages of disease through parenting and attachment-related variables is associated with social cognition.

Antonucci L, Raio A, Pergola G, Gelao B, Papalino M, Rampino A BMC Psychol. 2021; 9(1):47.

PMID: 33757595 PMC: 7989088. DOI: 10.1186/s40359-021-00552-3.