Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Data for: Ignoring species availability biases occupancy estimates in single-scale occupancy models

Metadata Updated: July 6, 2024

We simulate over 28,000 datasets and saved their model outputs to answer the following three questions: (1) what is an adequate sampling design for the multi-scale occupancy model when there are a priori expectations of parameter estimates?, (2) what is an adequate sampling design when we have no expectations of parameter estimates?, and (3) what is the cost (in terms of bias, accuracy, precision and coverage) in occupancy estimates) if availability is not accounted for?

Specifically, we simulated data under four scenarios: Scenario 1 (n = 10,000): Species availability is constant across sites (but less than one), Scenario 2 (n = 9,358): Species availability is heterogenous across sites, Scenario 3 (n = 2,815): Species availability is heterogenous across years, and Scenario 4 (n = 5,942): Species availability is correlated to their detection probability.

Then, for each scenario except the first, we analyzed the data using four different estimators: (i) constant multi-scale occupancy model, (ii) multi-scale occupancy model with a random-effects term in the availability part of the model, (iii) constant single-scale occupancy model, and (iv) single-scale occupancy model with a random-effects term in the detection part of the model. Note the formulation of the random-effects terms included in the models mimicked the way that data were simulated (e.g., if species availability was heterogenous across sites, then a site random-effects term was included in the models). The first scenario was analyzed using models (i) and (iii) only. For simplicity, we refer to models (i) and (iii) as ‘constant’ models or 'fixed-effects' models. We refer to models (ii) and (iv) as ‘random-effects’ models.

The summary of simulated data and model estimates are located in four folders, each corresponding to a different simulated scenario: Scenario 1 (n = 10,000): Folder ModelOutput_Scen1_TwolevelSim = csv files holding data are named Results_TwoLevelAvail_2lev_x.csv Scenario 2 (n = 9,358): Folder ModelOutput_Scen2_HeteroSite = csv files holding data are named Results_TwoLevelAvail_Hetero_x.csv Scenario 3 (n = 2,815): Folder ModelOutput_Scen3_HeteroYear = csv files holding data are named Results_TwoLevelAvail_HeteroSeason_x.csv Scenario 4 (n = 5,942): Folder ModelOutput_Scen4_Cor = csv files holding data are named Results_TwoLevelAvail_Cor_x.csv

Each row in each of the csv files contains information related to a different simulated dataset and includes information related to: sampling design, true parameter values, and model estimates. Other files in the folder correspond to the entire model output (.rda files), time for model run to complete (time_..csv), and a file indicating whether or not the model run finished (nsim...csv). For more information related to those files, we point the user to the code that generated them: Scenario 1 (n = 10,000): Scen1_Constant.R Scenario 2 (n = 9,358): Scen2_HeteroSite.R Scenario 3 (n = 2,815): Scen3_HeteroYear.R Scenario 4 (n = 5,942): Scen4_Corr.R

Access & Use Information

Public: This dataset is intended for public access and use. License: No license information was provided. If this work was prepared by an officer or employee of the United States government as part of that person's official duties it is considered a U.S. Government Work.

Downloads & Resources

Dates

Metadata Created Date June 1, 2023
Metadata Updated Date July 6, 2024

Metadata Source

Harvested from DOI EDI

Additional Metadata

Resource Type Dataset
Metadata Created Date June 1, 2023
Metadata Updated Date July 6, 2024
Publisher U.S. Geological Survey
Maintainer
@Id http://datainventory.doi.gov/id/dataset/2b135070a7d2e6d4b751b68e4bbed9bc
Identifier USGS:91f83b3b-f11b-482e-bd9c-f176dbd37078
Data Last Modified 20210616
Category geospatial
Public Access Level public
Bureau Code 010:12
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Metadata Catalog ID https://datainventory.doi.gov/data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id db5e27dd-9ad3-4f58-af6a-50c1df19ea36
Harvest Source Id 52bfcc16-6e15-478f-809a-b1bc76f1aeda
Harvest Source Title DOI EDI
Metadata Type geospatial
Old Spatial -72.66357421875001,42.30169032824452,-72.62649536132814,42.32707774458643
Publisher Hierarchy White House > U.S. Department of the Interior > U.S. Geological Survey
Source Datajson Identifier True
Source Hash c02707333a25d03ca61a1c6e30e6edca46a5d3a4c656ca650cf670ea95193d29
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -72.66357421875001, 42.30169032824452, -72.66357421875001, 42.32707774458643, -72.62649536132814, 42.32707774458643, -72.62649536132814, 42.30169032824452, -72.66357421875001, 42.30169032824452}

Didn't find what you're looking for? Suggest a dataset here.