» Articles » PMID: 19616692

Key Wavelengths Screening Using Competitive Adaptive Reweighted Sampling Method for Multivariate Calibration

Overview
Journal Anal Chim Acta
Publisher Elsevier
Specialty Chemistry
Date 2009 Jul 21
PMID 19616692
Citations 157
Authors
Affiliations
Soon will be listed here.
Abstract

By employing the simple but effective principle 'survival of the fittest' on which Darwin's Evolution Theory is based, a novel strategy for selecting an optimal combination of key wavelengths of multi-component spectral data, named competitive adaptive reweighted sampling (CARS), is developed. Key wavelengths are defined as the wavelengths with large absolute coefficients in a multivariate linear regression model, such as partial least squares (PLS). In the present work, the absolute values of regression coefficients of PLS model are used as an index for evaluating the importance of each wavelength. Then, based on the importance level of each wavelength, CARS sequentially selects N subsets of wavelengths from N Monte Carlo (MC) sampling runs in an iterative and competitive manner. In each sampling run, a fixed ratio (e.g. 80%) of samples is first randomly selected to establish a calibration model. Next, based on the regression coefficients, a two-step procedure including exponentially decreasing function (EDF) based enforced wavelength selection and adaptive reweighted sampling (ARS) based competitive wavelength selection is adopted to select the key wavelengths. Finally, cross validation (CV) is applied to choose the subset with the lowest root mean square error of CV (RMSECV). The performance of the proposed procedure is evaluated using one simulated dataset together with one near infrared dataset of two properties. The results reveal an outstanding characteristic of CARS that it can usually locate an optimal combination of some key wavelengths which are interpretable to the chemical property of interest. Additionally, our study shows that better prediction is obtained by CARS when compared to full spectrum PLS modeling, Monte Carlo uninformative variable elimination (MC-UVE) and moving window partial least squares regression (MWPLSR).

Citing Articles

Deep learning and hyperspectral features for seedling stage identification of barnyard grass in paddy field.

Tan S, Xie Q, Zhu W, Deng Y, Zhu L, Yu X Front Plant Sci. 2025; 16:1507442.

PMID: 39990719 PMC: 11842386. DOI: 10.3389/fpls.2025.1507442.


Constructing an origin discrimination model of japonica rice in Heilongjiang Province based on confocal microscopy Raman spectroscopy technology.

Zhang G, Liu J, Li Z, Li N, Zhang D Sci Rep. 2025; 15(1):5848.

PMID: 39966446 PMC: 11836377. DOI: 10.1038/s41598-024-83894-3.


Improving Soil Heavy Metal Lead Inversion Through Combined Band Selection Methods: A Case Study in Gejiu City, China.

He P, Cheng X, Wen X, Cao Y, Chen Y Sensors (Basel). 2025; 25(3).

PMID: 39943324 PMC: 11819662. DOI: 10.3390/s25030684.


A machine learning based variable selection algorithm for binary classification of perinatal mortality.

Sadiq M, Shah R PLoS One. 2025; 20(1):e0315498.

PMID: 39821154 PMC: 11737800. DOI: 10.1371/journal.pone.0315498.


Monitoring the Concentrations of Na, Mg, Ca, Cu, Fe, and K in at Different Growth Stages by NIR Spectroscopy Coupled with Chemometrics.

Wei S, Huang J, Niu Y, Tong H, Su L, Zhang X Foods. 2025; 14(1.

PMID: 39796412 PMC: 11719690. DOI: 10.3390/foods14010122.