» Articles » PMID: 35818973

Avoiding C-hacking when Evaluating Survival Distribution Predictions with Discrimination Measures

Overview
Journal Bioinformatics
Specialty Biology
Date 2022 Jul 12
PMID 35818973
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: In this article, we consider how to evaluate survival distribution predictions with measures of discrimination. This is non-trivial as discrimination measures are the most commonly used in survival analysis and yet there is no clear method to derive a risk prediction from a distribution prediction. We survey methods proposed in literature and software and consider their respective advantages and disadvantages.

Results: Whilst distributions are frequently evaluated by discrimination measures, we find that the method for doing so is rarely described in the literature and often leads to unfair comparisons or 'C-hacking'. We demonstrate by example how simple it can be to manipulate results and use this to argue for better reporting guidelines and transparency in the literature. We recommend that machine learning survival analysis software implements clear transformations between distribution and risk predictions in order to allow more transparent and accessible model evaluation.

Availability And Implementation: The code used in the final experiment is available at https://github.com/RaphaelS1/distribution_discrimination.

Citing Articles

Penalized landmark supermodels (penLM) for dynamic prediction for time-to-event outcomes in high-dimensional data.

Fries A, Choi E, Han S BMC Med Res Methodol. 2025; 25(1):22.

PMID: 39871161 PMC: 11771018. DOI: 10.1186/s12874-024-02418-9.


Does combining numerous data types in multi-omics data improve or hinder performance in survival prediction? Insights from a large-scale benchmark study.

Li Y, Herold T, Mansmann U, Hornung R BMC Med Inform Decis Mak. 2024; 24(1):244.

PMID: 39223659 PMC: 11370316. DOI: 10.1186/s12911-024-02642-9.


Tutorial on survival modeling with applications to omics data.

Zhao Z, Zobolas J, Zucknick M, Aittokallio T Bioinformatics. 2024; 40(3).

PMID: 38445722 PMC: 10973942. DOI: 10.1093/bioinformatics/btae132.


Systematic comparison of multi-omics survival models reveals a widespread lack of noise resistance.

Wissel D, Rowson D, Boeva V Cell Rep Methods. 2023; 3(4):100461.

PMID: 37159669 PMC: 10162996. DOI: 10.1016/j.crmeth.2023.100461.

References
1.
Mantel N, Bohidar N, CIMINERA J . Mantel-Haenszel analyses of litter-matched time-to-response data, with modifications for recovery of interlitter information. Cancer Res. 1977; 37(11):3863-8. View

2.
Spooner A, Chen E, Sowmya A, Sachdev P, Kochan N, Trollor J . A comparison of machine learning methods for survival analysis of high-dimensional clinical data for dementia prediction. Sci Rep. 2020; 10(1):20410. PMC: 7683682. DOI: 10.1038/s41598-020-77220-w. View

3.
Sonabend R, Kiraly F, Bender A, Bischl B, Lang M . mlr3proba: an R package for machine learning in survival analysis. Bioinformatics. 2021; 37(17):2789-2791. PMC: 8428574. DOI: 10.1093/bioinformatics/btab039. View

4.
Korn E, Simon R . Measures of explained variation for survival data. Stat Med. 1990; 9(5):487-503. DOI: 10.1002/sim.4780090503. View

5.
Collins G, de Groot J, Dutton S, Omar O, Shanyinde M, Tajar A . External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol. 2014; 14:40. PMC: 3999945. DOI: 10.1186/1471-2288-14-40. View