» Articles » PMID: 34266447

Prediction of Plant-level Tomato Biomass and Yield Using Machine Learning with Unmanned Aerial Vehicle Imagery

Overview
Journal Plant Methods
Publisher Biomed Central
Specialty Biology
Date 2021 Jul 16
PMID 34266447
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The objective of this study is twofold. First, ascertain the important variables that predict tomato yields from plant height (PH) and vegetation index (VI) maps. The maps were derived from images taken by unmanned aerial vehicles (UAVs). Second, examine the accuracy of predictions of tomato fresh shoot masses (SM), fruit weights (FW), and the number of fruits (FN) from multiple machine learning algorithms using selected variable sets. To realize our objective, ultra-high-resolution RGB and multispectral images were collected by a UAV on ten days in 2020's tomato growing season. From these images, 756 total variables, including first- (e.g., average, standard deviation, skewness, range, and maximum) and second-order (e.g., gray-level co-occurrence matrix features and growth rates of PH and VIs) statistics for each plant, were extracted. Several selection algorithms (i.e., Boruta, DALEX, genetic algorithm, least absolute shrinkage and selection operator, and recursive feature elimination) were used to select the variable sets useful for predicting SM, FW, and FN. Random forests, ridge regressions, and support vector machines were used to predict the yield using the top five selected variable sets.

Results: First-order statistics of PH and VIs collected during the early to mid-fruit formation periods, about one month prior to harvest, were important variables for predicting SM. Similar to the case for SM, variables collected approximately one month prior to harvest were important for predicting FW and FN. Furthermore, variables related to PH were unimportant for prediction. Compared with predictions obtained using only first-order statistics, those obtained using the second-order statistics of VIs were more accurate for FW and FN. The prediction accuracy of SM, FW, and FN by models constructed from all variables (rRMSE = 8.8-28.1%) was better than that from first-order statistics (rRMSE = 10.0-50.1%).

Conclusions: In addition to basic statistics (e.g., average and standard deviation), we derived second-order statistics of PH and VIs at the plant level using the ultra-high resolution UAV images. Our findings indicated that our variable selection method reduced the number variables needed for tomato yield prediction, improving the efficiency of phenotypic data collection and assisting with the selection of high-yield lines within breeding programs.

Citing Articles

Data-driven analysis of climate impact on tomato and apple prices using machine learning.

Yoon S, Kim T, Kim D Heliyon. 2025; 11(1):e41478.

PMID: 39834407 PMC: 11742837. DOI: 10.1016/j.heliyon.2024.e41478.


Field-grown tomato yield estimation using point cloud segmentation with 3D shaping and RGB pictures from a field robot and digital single lens reflex cameras.

Ambrus B, Teschner G, Kovacs A, Nemenyi M, Helyes L, Pek Z Heliyon. 2024; 10(20):e37997.

PMID: 39640729 PMC: 11619985. DOI: 10.1016/j.heliyon.2024.e37997.


Precision estimation of winter wheat crop height and above-ground biomass using unmanned aerial vehicle imagery and oblique photoghraphy point cloud data.

Li Y, Li C, Cheng Q, Chen L, Li Z, Zhai W Front Plant Sci. 2024; 15:1437350.

PMID: 39359624 PMC: 11446220. DOI: 10.3389/fpls.2024.1437350.


Bioinformatics Identification and Experimental Verification of Disulfidptosis-Related Genes in the Progression of Osteoarthritis.

Cao S, Wei Y, Yue Y, Wang D, Xiong A, Yang J Biomedicines. 2024; 12(8).

PMID: 39200304 PMC: 11351109. DOI: 10.3390/biomedicines12081840.


Combining features selection strategy and features fusion strategy for SPAD estimation of winter wheat based on UAV multispectral imagery.

Su X, Nian Y, Shaghaleh H, Hamad A, Yue H, Zhu Y Front Plant Sci. 2024; 15:1404238.

PMID: 38799101 PMC: 11116665. DOI: 10.3389/fpls.2024.1404238.


References
1.
Liakos K, Busato P, Moshou D, Pearson S, Bochtis D . Machine Learning in Agriculture: A Review. Sensors (Basel). 2018; 18(8). PMC: 6111295. DOI: 10.3390/s18082674. View

2.
de Vlaming R, Groenen P . The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics. Biomed Res Int. 2015; 2015:143712. PMC: 4529984. DOI: 10.1155/2015/143712. View

3.
Hassan M, Yang M, Rasheed A, Yang G, Reynolds M, Xia X . A rapid monitoring of NDVI across the wheat growth cycle for grain yield prediction using a multi-spectral UAV platform. Plant Sci. 2019; 282:95-103. DOI: 10.1016/j.plantsci.2018.10.022. View

4.
Wang X, Singh D, Marla S, Morris G, Poland J . Field-based high-throughput phenotyping of plant height in sorghum using different sensing technologies. Plant Methods. 2018; 14:53. PMC: 6031187. DOI: 10.1186/s13007-018-0324-5. View

5.
Tattaris M, Reynolds M, Chapman S . A Direct Comparison of Remote Sensing Approaches for High-Throughput Phenotyping in Plant Breeding. Front Plant Sci. 2016; 7:1131. PMC: 4971441. DOI: 10.3389/fpls.2016.01131. View