Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Species Distribution Models for Pectis imberbis, a Rare Plant Species in Southeastern Arizona

Metadata Updated: September 26, 2024

Species distribution models (SDMs) can be an important tool in rare species conservation. Specifically, SDMs have been used to location previously unknown populations and identify sites for reintroduction or translocation. With these goals in mind, we applied SDM to a recently listed plant species, Pectis imberbis, which is found in the Madrean Archipelago region of southern Arizona, USA, and northern Mexico. We used presence-pseudoabsence data and applied 10 replicates of 5 modeling algorithms, generalized linear model (GLM), generalized additive model (GAM, generalized boosted model (GBM, aka boosted regression trees), random forests (RF), and classification tree analysis (CTA) to 4 different predictor datasets which were developed with correlation analysis, feature selection, and variable importance analysis. The resulting models were evaluated based on k-fold cross validation using 4 different metrics: Cohen’s kappa statistic (K), the area under the curve of the receiver operating characteristic curve (ROC), the True Skill Statistic (TSS), and the Boyce Index (BI). High performing models were then included in ensemble model building using the ensemble mean, ensemble median, and committee averaging methods. We applied optimized threshold values to transform continuous species presence probability rasters into binary presence/absence rasters. We also computed the coefficient of variation for the model components of each predictor dataset. Based on calibration metrics, coefficient of variation between component models, consistency across known populations, and data coverage, the best model from these analyses for this application is the ensemble committee averaging based on the ROC metric using the 5-predictor dataset with USGS geologic data and a threshold based on optimizing the TSS (USGS-5v_EMcaByROC_binTSS.tif). However, we present all 100 modeling outputs rasters (24 continuous rasters, 72 binary rasters, and 4 coefficient of variation rasters) in this data release.

Access & Use Information

Public: This dataset is intended for public access and use. License: No license information was provided. If this work was prepared by an officer or employee of the United States government as part of that person's official duties it is considered a U.S. Government Work.

Downloads & Resources

Dates

Metadata Created Date September 22, 2024
Metadata Updated Date September 26, 2024

Metadata Source

Harvested from DOI EDI

Additional Metadata

Resource Type Dataset
Metadata Created Date September 22, 2024
Metadata Updated Date September 26, 2024
Publisher U.S. Geological Survey
Maintainer
@Id http://datainventory.doi.gov/id/dataset/0e9fe712678f8560a7301f519062ff16
Identifier USGS:6675cc03d34e6f159fd10f00
Data Last Modified 20240923
Category geospatial
Public Access Level public
Bureau Code 010:12
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Metadata Catalog ID https://datainventory.doi.gov/data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id ea3450e5-02f4-4cd4-9d45-ec7a19c402b2
Harvest Source Id 52bfcc16-6e15-478f-809a-b1bc76f1aeda
Harvest Source Title DOI EDI
Metadata Type geospatial
Old Spatial -111.7485,31.3179,-109.0346,32.089
Publisher Hierarchy White House > U.S. Department of the Interior > U.S. Geological Survey
Source Datajson Identifier True
Source Hash b5933986e57b5a1107d294698a8c7e456ff15b178217ae4ac5a3e863a6125b9f
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -111.7485, 31.3179, -111.7485, 32.089, -109.0346, 32.089, -109.0346, 31.3179, -111.7485, 31.3179}

Didn't find what you're looking for? Suggest a dataset here.