» Articles » PMID: 20823336

Mass Spectrometry Data Processing Using Zero-crossing Lines in Multi-scale of Gaussian Derivative Wavelet

Overview
Journal Bioinformatics
Specialty Biology
Date 2010 Sep 9
PMID 20823336
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Peaks are the key information in mass spectrometry (MS) which has been increasingly used to discover diseases-related proteomic patterns. Peak detection is an essential step for MS-based proteomic data analysis. Recently, several peak detection algorithms have been proposed. However, in these algorithms, there are three major deficiencies: (i) because the noise is often removed, the true signal could also be removed; (ii) baseline removal step may get rid of true peaks and create new false peaks; (iii) in peak quantification step, a threshold of signal-to-noise ratio (SNR) is usually used to remove false peaks; however, noise estimations in SNR calculation are often inaccurate in either time or wavelet domain. In this article, we propose new algorithms to solve these problems. First, we use bivariate shrinkage estimator in stationary wavelet domain to avoid removing true peaks in denoising step. Second, without baseline removal, zero-crossing lines in multi-scale of derivative Gaussian wavelets are investigated with mixture of Gaussian to estimate discriminative parameters of peaks. Third, in quantification step, the frequency, SD, height and rank of peaks are used to detect both high and small energy peaks with robustness to noise.

Results: We propose a novel Gaussian Derivative Wavelet (GDWavelet) method to more accurately detect true peaks with a lower false discovery rate than existing methods. The proposed GDWavelet method has been performed on the real Surface-Enhanced Laser Desorption/Ionization Time-Of-Flight (SELDI-TOF) spectrum with known polypeptide positions and on two synthetic data with Gaussian and real noise. All experimental results demonstrate that our method outperforms other commonly used methods. The standard receiver operating characteristic (ROC) curves are used to evaluate the experimental results.

Availability: http://ranger.uta.edu/~heng/MS/GDWavelet.html or http://www.naaan.org/nhanguyen/archive.htm.

Citing Articles

TIHI Toolkit: A Peak Finder and Analyzer for Spectroscopic Data.

Han K, Boziki A, Tkatchenko A, Berryman J ACS Omega. 2024; 9(50):49397-49410.

PMID: 39713663 PMC: 11656381. DOI: 10.1021/acsomega.4c06830.


Current approaches and outstanding challenges of functional annotation of metabolites: a comprehensive review.

Nguyen Q, Nguyen H, Oh E, Nguyen T Brief Bioinform. 2024; 25(6).

PMID: 39397425 PMC: 11471905. DOI: 10.1093/bib/bbae498.


Composite Multidimensional Ion Mobility-Mass Spectrometry for Improved Differentiation of Stereochemical Modifications.

Xu X, Han L, Zheng Z, Zhao R, Li L, Shao X Anal Chem. 2023; 95(4):2221-2228.

PMID: 36635260 PMC: 10276620. DOI: 10.1021/acs.analchem.2c03522.


Joint Bounding of Peaks Across Samples Improves Differential Analysis in Mass Spectrometry-Based Metabolomics.

Myint L, Kleensang A, Zhao L, Hartung T, Hansen K Anal Chem. 2017; 89(6):3517-3523.

PMID: 28221771 PMC: 5362739. DOI: 10.1021/acs.analchem.6b04719.


Peptide Peak Detection for Low Resolution MALDI-TOF Mass Spectrometry.

Yao J, Utsunomiya S, Kajihara S, Tabata T, Aoshima K, Oda Y Mass Spectrom (Tokyo). 2016; 3(1):A0030.

PMID: 26819872 PMC: 4306743. DOI: 10.5702/massspectrometry.A0030.


References
1.
Wong J, Cagney G, Cartwright H . SpecAlign--processing and alignment of mass spectra datasets. Bioinformatics. 2005; 21(9):2088-90. DOI: 10.1093/bioinformatics/bti300. View

2.
Nguyen N, Huang H, Oraintara S, Vo A . Stationary wavelet packet transform and dependent laplacian bivariate shrinkage estimator for array-CGH data smoothing. J Comput Biol. 2010; 17(2):139-52. DOI: 10.1089/cmb.2009.0013. View

3.
Yuille A, Poggio T . Scaling theorems for zero crossings. IEEE Trans Pattern Anal Mach Intell. 2011; 8(1):15-25. DOI: 10.1109/tpami.1986.4767748. View

4.
Myers C, Dunham M, Kung S, Troyanskaya O . Accurate detection of aneuploidies in array CGH and gene expression microarray data. Bioinformatics. 2004; 20(18):3533-43. DOI: 10.1093/bioinformatics/bth440. View

5.
Du P, Kibbe W, Lin S . Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching. Bioinformatics. 2006; 22(17):2059-65. DOI: 10.1093/bioinformatics/btl355. View