» Articles » PMID: 29271909

Comparison of Random Forest, K-Nearest Neighbor, and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery

Overview
Journal Sensors (Basel)
Publisher MDPI
Specialty Biotechnology
Date 2017 Dec 23
PMID 29271909
Citations 59
Authors
Affiliations
Soon will be listed here.
Abstract

In previous classification studies, three non-parametric classifiers, Random Forest (RF), k-Nearest Neighbor (kNN), and Support Vector Machine (SVM), were reported as the foremost classifiers at producing high accuracies. However, only a few studies have compared the performances of these classifiers with different training sample sizes for the same remote sensing images, particularly the Sentinel-2 Multispectral Imager (MSI). In this study, we examined and compared the performances of the RF, kNN, and SVM classifiers for land use/cover classification using Sentinel-2 image data. An area of 30 × 30 km² within the Red River Delta of Vietnam with six land use/cover types was classified using 14 different training sample sizes, including balanced and imbalanced, from 50 to over 1250 pixels/class. All classification results showed a high overall accuracy (OA) ranging from 90% to 95%. Among the three classifiers and 14 sub-datasets, SVM produced the highest OA with the least sensitivity to the training sample sizes, followed consecutively by RF and kNN. In relation to the sample size, all three classifiers showed a similar and high OA (over 93.85%) when the training sample size was large enough, i.e., greater than 750 pixels/class or representing an area of approximately 0.25% of the total study area. The high accuracy was achieved with both imbalanced and balanced datasets.

Citing Articles

A machine learning approach to predict mortality and neonatal persistent pulmonary hypertension in newborns with congenital diaphragmatic hernia. A retrospective observational cohort study.

Conte L, Amodeo I, De Nunzio G, Raffaeli G, Borzani I, Persico N Eur J Pediatr. 2025; 184(4):238.

PMID: 40067512 PMC: 11897082. DOI: 10.1007/s00431-025-06073-0.


MIML: multiplex image machine learning for high precision cell classification via mechanical traits within microfluidic systems.

Islam K, Paul R, Wang S, Zhao Y, Adhikary P, Li Q Microsyst Nanoeng. 2025; 11(1):43.

PMID: 40050640 PMC: 11885814. DOI: 10.1038/s41378-025-00874-x.


Stacked encoding and AutoML-based identification of lead-zinc small open pit active mines around Rampura Agucha in Rajasthan state, India.

Ojha A, Biswas R, Krishna A Sci Rep. 2025; 15(1):5766.

PMID: 39962260 PMC: 11832769. DOI: 10.1038/s41598-025-89672-z.


Predicting software reuse using machine learning techniques-A case study on open-source Java software systems.

Yeow M, Chong C, Lim M, Yee Yen Y PLoS One. 2025; 20(2):e0314512.

PMID: 39946354 PMC: 11824963. DOI: 10.1371/journal.pone.0314512.


Road urban planning sustainability based on remote sensing and satellite dataset: A review.

Mhana K, Norhisham S, Katman H, Yaseen Z Heliyon. 2024; 10(21):e39567.

PMID: 39524728 PMC: 11550651. DOI: 10.1016/j.heliyon.2024.e39567.


References
1.
Foley J, DeFries R, Asner G, Barford C, Bonan G, Carpenter S . Global consequences of land use. Science. 2005; 309(5734):570-4. DOI: 10.1126/science.1111772. View

2.
Gao Q, Zribi M, Escorihuela M, Baghdadi N . Synergetic Use of Sentinel-1 and Sentinel-2 Data for Soil Moisture Mapping at 100 m Resolution. Sensors (Basel). 2017; 17(9). PMC: 5621168. DOI: 10.3390/s17091966. View