» Articles » PMID: 28808561

Quantifying the Degree of Bias from Using County-scale Data in Species Distribution Modeling: Can Increasing Sample Size or Using County-averaged Environmental Data Reduce Distributional Overprediction?

Overview
Journal Ecol Evol
Date 2017 Aug 16
PMID 28808561
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Citizen-science databases have been used to develop species distribution models (SDMs), although many taxa may be only georeferenced to county. It is tacitly assumed that SDMs built from county-scale data should be less precise than those built with more accurate localities, but the extent of the bias is currently unknown. Our aims in this study were to illustrate the effects of using county-scale data on the spatial extent and accuracy of SDMs relative to true locality data and to compare potential compensatory methods (including increased sample size and using overall county environmental averages rather than point locality environmental data). To do so, we developed SDMs in maxent with PRISM-derived BIOCLIM parameters for 283 and 230 species of odonates (dragonflies and damselflies) and butterflies, respectively, for five subsets from the OdonataCentral and Butterflies and Moths of North America citizen-science databases: (1) a true locality dataset, (2) a corresponding sister dataset of county-centroid coordinates, (3) a dataset where the average environmental conditions within each county were assigned to each record, (4) a 50/50% mix of true localities and county-centroid coordinates, and (5) a 50/50% mix of true localities and records assigned the average environmental conditions within each county. These mixtures allowed us to quantify the degree of bias from county-scale data. Models developed with county centroids overpredicted the extent of suitable habitat by 15% on average compared to true locality models, although larger sample sizes (>100 locality records) reduced this disparity. Assigning county-averaged environmental conditions did not offer consistent improvement, however. Because county-level data are of limited value for developing SDMs except for species that are widespread and well collected or that inhabit regions where small, climatically uniform counties predominate, three means of encouraging more accurate georeferencing in citizen-science databases are provided.

Citing Articles

Human risk to tick encounters in the southeastern United States estimated with spatial distribution modeling.

Butler R, Papes M, Vogt J, Paulsen D, Crowe C, Trout Fryxell R PLoS Negl Trop Dis. 2024; 18(2):e0011919.

PMID: 38354196 PMC: 10898775. DOI: 10.1371/journal.pntd.0011919.


Investigating the potential of social media and citizen science data to track changes in species' distributions.

ONeill D, Hakkinen H, Neumann J, Shaffrey L, Cheffings C, Norris K Ecol Evol. 2023; 13(5):e10063.

PMID: 37168983 PMC: 10166650. DOI: 10.1002/ece3.10063.


Trends and gaps in the use of citizen science derived data as input for species distribution models: A quantitative review.

Feldman M, Imbeau L, Marchand P, Mazerolle M, Darveau M, Fenton N PLoS One. 2021; 16(3):e0234587.

PMID: 33705414 PMC: 7951830. DOI: 10.1371/journal.pone.0234587.


Quantifying the degree of bias from using county-scale data in species distribution modeling: Can increasing sample size or using county-averaged environmental data reduce distributional overprediction?.

Collins S, Abbott J, McIntyre N Ecol Evol. 2017; 7(15):6012-6022.

PMID: 28808561 PMC: 5551104. DOI: 10.1002/ece3.3115.

References
1.
Warren D, Seifert S . Ecological niche modeling in Maxent: the importance of model complexity and the performance of model selection criteria. Ecol Appl. 2011; 21(2):335-42. DOI: 10.1890/10-1171.1. View

2.
Collins S, Abbott J, McIntyre N . Quantifying the degree of bias from using county-scale data in species distribution modeling: Can increasing sample size or using county-averaged environmental data reduce distributional overprediction?. Ecol Evol. 2017; 7(15):6012-6022. PMC: 5551104. DOI: 10.1002/ece3.3115. View

3.
Wood C, Sullivan B, Iliff M, Fink D, Kelling S . eBird: engaging birders in science and conservation. PLoS Biol. 2011; 9(12):e1001220. PMC: 3243722. DOI: 10.1371/journal.pbio.1001220. View

4.
Warren D, Glor R, Turelli M . Environmental niche equivalency versus conservatism: quantitative approaches to niche evolution. Evolution. 2008; 62(11):2868-83. DOI: 10.1111/j.1558-5646.2008.00482.x. View

5.
Fithian W, Elith J, Hastie T, Keith D . Bias correction in species distribution models: pooling survey and collection data for multiple species. Methods Ecol Evol. 2016; 6(4):424-438. PMC: 5102514. DOI: 10.1111/2041-210X.12242. View