» Articles » PMID: 33098413

RobNorm: Model-based Robust Normalization Method for Labeled Quantitative Mass Spectrometry Proteomics Data

Overview
Journal Bioinformatics
Specialty Biology
Date 2020 Oct 24
PMID 33098413
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Data normalization is an important step in processing proteomics data generated in mass spectrometry experiments, which aims to reduce sample-level variation and facilitate comparisons of samples. Previously published methods for normalization primarily depend on the assumption that the distribution of protein expression is similar across all samples. However, this assumption fails when the protein expression data is generated from heterogenous samples, such as from various tissue types. This led us to develop a novel data-driven method for improved normalization to correct the systematic bias meanwhile maintaining underlying biological heterogeneity.

Results: To robustly correct the systematic bias, we used the density-power-weight method to down-weigh outliers and extended the one-dimensional robust fitting method described in the previous work to our structured data. We then constructed a robustness criterion and developed a new normalization algorithm, called RobNorm.In simulation studies and analysis of real data from the genotype-tissue expression project, we compared and evaluated the performance of RobNorm against other normalization methods. We found that the RobNorm approach exhibits the greatest reduction in systematic bias while maintaining across-tissue variation, especially for datasets from highly heterogeneous samples.

Availabilityand Implementation: https://github.com/mwgrassgreen/RobNorm.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Comprehensive Overview of Bottom-Up Proteomics Using Mass Spectrometry.

Jiang Y, Rex D, Schuster D, Neely B, Rosano G, Volkmar N ACS Meas Sci Au. 2024; 4(4):338-417.

PMID: 39193565 PMC: 11348894. DOI: 10.1021/acsmeasuresciau.3c00068.


Comprehensive Overview of Bottom-Up Proteomics using Mass Spectrometry.

Jiang Y, Rex D, Schuster D, Neely B, Rosano G, Volkmar N ArXiv. 2023; .

PMID: 38013887 PMC: 10680866.


Deep Proteomics Network and Machine Learning Analysis of Human Cerebrospinal Fluid in Japanese Encephalitis Virus Infection.

Bharucha T, Gangadharan B, Kumar A, Myall A, Ayhan N, Pastorino B J Proteome Res. 2023; 22(6):1614-1629.

PMID: 37219084 PMC: 10246887. DOI: 10.1021/acs.jproteome.2c00563.


Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics.

Chion M, Carapito C, Bertrand F PLoS Comput Biol. 2022; 18(8):e1010420.

PMID: 36037245 PMC: 9462777. DOI: 10.1371/journal.pcbi.1010420.


Diagnostics and correction of batch effects in large-scale proteomic studies: a tutorial.

cuklina J, Lee C, Williams E, Sajic T, Collins B, Rodriguez Martinez M Mol Syst Biol. 2021; 17(8):e10240.

PMID: 34432947 PMC: 8447595. DOI: 10.15252/msb.202110240.


References
1.
Dieterle F, Ross A, Schlotterbeck G, Senn H . Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. Anal Chem. 2006; 78(13):4281-90. DOI: 10.1021/ac051632c. View

2.
Oberg A, Mahoney D . Statistical methods for quantitative mass spectrometry proteomic experiments with labeling. BMC Bioinformatics. 2012; 13 Suppl 16:S7. PMC: 3489540. DOI: 10.1186/1471-2105-13-S16-S7. View

3.
Huber W, von Heydebreck A, Sultmann H, Poustka A, Vingron M . Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics. 2002; 18 Suppl 1:S96-104. DOI: 10.1093/bioinformatics/18.suppl_1.s96. View

4.
Karpievitch Y, Taverner T, Adkins J, Callister S, Anderson G, Smith R . Normalization of peak intensities in bottom-up MS-based proteomics using singular value decomposition. Bioinformatics. 2009; 25(19):2573-80. PMC: 2752608. DOI: 10.1093/bioinformatics/btp426. View

5.
Ritchie M, Phipson B, Wu D, Hu Y, Law C, Shi W . limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015; 43(7):e47. PMC: 4402510. DOI: 10.1093/nar/gkv007. View