» Articles » PMID: 36308688

Towards a More Accurate Differential Analysis of Multiple Imputed Proteomics Data with Mi4limma

Overview
Specialty Molecular Biology
Date 2022 Oct 29
PMID 36308688
Authors
Affiliations
Soon will be listed here.
Abstract

Imputing missing values is a common practice in label-free quantitative proteomics. Imputation replaces a missing value by a user-defined one. However, the imputation itself is not optimally considered downstream of the imputation process. In particular, imputed datasets are considered as if they had always been complete. The uncertainty due to the imputation is not properly taken into account. Hence, the mi4p package provides a more accurate statistical analysis of multiple-imputed datasets. A rigorous multiple imputation methodology is implemented, leading to a less biased estimation of parameters and their variability, thanks to Rubin's rules. The imputation-based peptide's intensities' variance estimator is then moderated using Bayesian hierarchical models. This estimator is finally included in moderated t-test statistics to provide differential analyses results.

Citing Articles

Unravelling the sexual developmental biology of , a model for comparative coccidian parasite studies.

Cruz-Bustos T, Dolezal M, Feix A, Ruttkowski B, Hummel K, Razzazi-Fazeli E Front Cell Infect Microbiol. 2023; 13:1271731.

PMID: 37953800 PMC: 10635411. DOI: 10.3389/fcimb.2023.1271731.

References
1.
Chang C, Xu K, Guo C, Wang J, Yan Q, Zhang J . PANDA-view: an easy-to-use tool for statistical analysis and visualization of quantitative proteomics data. Bioinformatics. 2018; 34(20):3594-3596. PMC: 6184437. DOI: 10.1093/bioinformatics/bty408. View

2.
Choi M, Chang C, Clough T, Broudy D, Killeen T, MacLean B . MSstats: an R package for statistical analysis of quantitative mass spectrometry-based proteomic experiments. Bioinformatics. 2014; 30(17):2524-6. DOI: 10.1093/bioinformatics/btu305. View

3.
Wieczorek S, Combes F, Lazar C, Giai Gianetto Q, Gatto L, Dorffer A . DAPAR & ProStaR: software to perform statistical analyses in quantitative discovery proteomics. Bioinformatics. 2016; 33(1):135-136. PMC: 5408771. DOI: 10.1093/bioinformatics/btw580. View

4.
White I, Royston P, Wood A . Multiple imputation using chained equations: Issues and guidance for practice. Stat Med. 2011; 30(4):377-99. DOI: 10.1002/sim.4067. View

5.
Phipson B, Lee S, Majewski I, Alexander W, Smyth G . ROBUST HYPERPARAMETER ESTIMATION PROTECTS AGAINST HYPERVARIABLE GENES AND IMPROVES POWER TO DETECT DIFFERENTIAL EXPRESSION. Ann Appl Stat. 2017; 10(2):946-963. PMC: 5373812. DOI: 10.1214/16-AOAS920. View