Geographic Authentication of Leaves Using Multivariate Analysis and Preliminary Study on the Compositional Response to Environment
Overview
Affiliations
To explore the influences of different cultivated areas on the chemical profiles of leaves (EUL) and rapidly authenticate its geographical origins, 187 samples from 13 provinces in China were systematically investigated using three data fusion strategies (low, mid, and high level) combined with two discrimination model algorithms (partial least squares discrimination analysis; random forest, RF). RF models constructed by high-level data fusion with different modes of different spectral data (Fourier transform near-infrared spectrum and attenuated total reflection Fourier transform mid-infrared spectrum) were most suitable for identifying EULs from different geographical origins. The accuracy rates of calibration and validation set were 92.86% and 93.44%, respectively. In addition, climate parameters were systematically investigated the cluster difference in our study. Some interesting and novel information could be found from the clustering tree diagram of hierarchical cluster analysis. The Xinjiang Autonomous Region (Region 5) located in the high latitude area was the only region in the middle temperate zone of all sample collection areas in which the samples belonged to an individual class no matter their distance in the tree diagram. The samples were from a relatively high elevation in the Shennongjia Forest District in Hubei Province (>1200 m), which is the main difference from the samples from Xiangyang City (78 m). Thus, the sample clusters from region 9 are different from the sample clusters from other regions. The results would provide a reference for further research to those samples from the special cluster.
Zuo Z, Wang Y, Yao Z ACS Omega. 2025; 10(7):7242-7255.
PMID: 40028126 PMC: 11865977. DOI: 10.1021/acsomega.4c10816.
Chen M, Quan Z, Sun X, Li Y, Qian L, Zhang D Foods. 2025; 14(1.
PMID: 39796378 PMC: 11719895. DOI: 10.3390/foods14010089.
Recent trends of machine learning applied to multi-source data of medicinal plants.
Zhang Y, Wang Y J Pharm Anal. 2024; 13(12):1388-1407.
PMID: 38223450 PMC: 10785154. DOI: 10.1016/j.jpha.2023.07.012.
Application of Data Fusion in Traditional Chinese Medicine: A Review.
Huang R, Ma S, Dai S, Zheng J Sensors (Basel). 2024; 24(1).
PMID: 38202967 PMC: 10781265. DOI: 10.3390/s24010106.
Zhang Y, Wang Y Food Chem X. 2023; 19:100860.
PMID: 37780348 PMC: 10534232. DOI: 10.1016/j.fochx.2023.100860.