» Articles » PMID: 35585485

Evaluation of Statistical Approaches for Association Testing in Noisy Drug Screening Data

Overview
Publisher Biomed Central
Specialty Biology
Date 2022 May 18
PMID 35585485
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Identifying associations among biological variables is a major challenge in modern quantitative biological research, particularly given the systemic and statistical noise endemic to biological systems. Drug sensitivity data has proven to be a particularly challenging field for identifying associations to inform patient treatment.

Results: To address this, we introduce two semi-parametric variations on the commonly used concordance index: the robust concordance index and the kernelized concordance index (rCI, kCI), which incorporate measurements about the noise distribution from the data. We demonstrate that common statistical tests applied to the concordance index and its variations fail to control for false positives, and introduce efficient implementations to compute p-values using adaptive permutation testing. We then evaluate the statistical power of these coefficients under simulation and compare with Pearson and Spearman correlation coefficients. Finally, we evaluate the various statistics in matching drugs across pharmacogenomic datasets.

Conclusions: We observe that the rCI and kCI are better powered than the concordance index in simulation and show some improvement on real data. Surprisingly, we observe that the Pearson correlation was the most robust to measurement noise among the different metrics.

Citing Articles

Machine learning-driven exploration of drug therapies for triple-negative breast cancer treatment.

Kaushik A, Zhao Z Front Mol Biosci. 2023; 10:1215204.

PMID: 37602329 PMC: 10436744. DOI: 10.3389/fmolb.2023.1215204.


Ranking Breast Cancer Drugs and Biomarkers Identification Using Machine Learning and Pharmacogenomics.

Mehmood A, Nawab S, Jin Y, Hassan H, Kaushik A, Wei D ACS Pharmacol Transl Sci. 2023; 6(3):399-409.

PMID: 36926455 PMC: 10012252. DOI: 10.1021/acsptsci.2c00212.


Reassessing pharmacogenomic cell sensitivity with multilevel statistical models.

Ploenzke M, Irizarry R Biostatistics. 2022; 24(4):901-921.

PMID: 35277956 PMC: 10583722. DOI: 10.1093/biostatistics/kxac010.


PharmacoDB 2.0: improving scalability and transparency of in vitro pharmacogenomics analysis.

Feizi N, Kadambat Nair S, Smirnov P, Beri G, Eeles C, Esfahani P Nucleic Acids Res. 2021; 50(D1):D1348-D1357.

PMID: 34850112 PMC: 8728279. DOI: 10.1093/nar/gkab1084.

References
1.
Hafner M, Heiser L, Williams E, Niepel M, Wang N, Korkola J . Quantification of sensitivity and resistance of breast cancer cell lines to anti-cancer drugs using GR metrics. Sci Data. 2017; 4:170166. PMC: 5674849. DOI: 10.1038/sdata.2017.166. View

2.
Haibe-Kains B, El-Hachem N, Birkbak N, Jin A, Beck A, Aerts H . Inconsistency in large pharmacogenomic studies. Nature. 2013; 504(7480):389-93. PMC: 4237165. DOI: 10.1038/nature12831. View

3.
Mammoliti A, Smirnov P, Nakano M, Safikhani Z, Eeles C, Seo H . Orchestrating and sharing large multimodal data for transparent and reproducible research. Nat Commun. 2021; 12(1):5797. PMC: 8490371. DOI: 10.1038/s41467-021-25974-w. View

4.
Pencina M, DAgostino R . Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimation. Stat Med. 2004; 23(13):2109-23. DOI: 10.1002/sim.1802. View

5.
Bishara A, Hittner J . Testing the significance of a correlation with nonnormal data: comparison of Pearson, Spearman, transformation, and resampling approaches. Psychol Methods. 2012; 17(3):399-417. DOI: 10.1037/a0028087. View