Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Two-stage models improve machine learning classifiers in wildlife research: A case study in identifying false positive detections of Ruffed Grouse

Metadata Updated: September 12, 2025

Autonomous recording units are increasingly being used to monitor wildlife on large geographic and temporal scales, paired with machine learning (ML) to automate detection of wildlife. However, false positive detections from ML classifiers can result in erroneous ecological models that can lead to misguided management and conservation actions. We used a two-stage general approach to understand and reduce false positive detections, a technique in which outputs of the primary classification model are passed to a secondary classification model to yield the probability that a detection from the primary model is a true positive detection. This approach is demonstrated on two open-source models, BirdNET and the Drumming Model, that detect Ruffed Grouse (Bonasa umbellus). We analyzed over 9500 hours of acoustic data collected in 2022-2023 from the Green Mountain National Forest in Vermont, USA, and found the two models detected different types of acoustic signals associated with differing life history traits. The Drumming Model yielded 4106 detections (71.5% true positives); BirdNET yielded 524 detections (17.0% true positives). Secondary logistic regression models separated true positives and false positives with high accuracy (BirdNET = 84.5%; Drumming Model = 89.8%). Our findings go beyond improving Ruffed Grouse monitoring and conservation efforts to, more broadly, illustrate how two-stage ML approaches can improve the use of model-derived detections in wildlife research.

Access & Use Information

Public: This dataset is intended for public access and use. License: No license information was provided. If this work was prepared by an officer or employee of the United States government as part of that person's official duties it is considered a U.S. Government Work.

Downloads & Resources

Dates

Metadata Created Date September 12, 2025
Metadata Updated Date September 12, 2025

Metadata Source

Harvested from DOI USGS DCAT-US

Additional Metadata

Resource Type Dataset
Metadata Created Date September 12, 2025
Metadata Updated Date September 12, 2025
Publisher U.S. Geological Survey
Maintainer
Identifier http://datainventory.doi.gov/id/dataset/usgs-679392d5d34e88f5864c50b5
Data Last Modified 2025-04-30T00:00:00Z
Category geospatial
Public Access Level public
Bureau Code 010:12
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Metadata Catalog ID https://ddi.doi.gov/usgs-data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id e298a306-768f-41e1-a69c-a7e1b824e1a1
Harvest Source Id 2b80d118-ab3a-48ba-bd93-996bbacefac2
Harvest Source Title DOI USGS DCAT-US
Metadata Type geospatial
Old Spatial -73.2300, 42.7300, -72.7400, 44.1600
Source Datajson Identifier True
Source Hash e4a3319955c2791805943f6794510c8e567ad282b3eec54e785789f062039355
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -73.2300, 42.7300, -73.2300, 44.1600, -72.7400, 44.1600, -72.7400, 42.7300, -73.2300, 42.7300}

Didn't find what you're looking for? Suggest a dataset here.