Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Data and model code in support of Stream nitrate dynamics driven primarily by discharge and watershed physical and soil characteristics at intensively monitored sites, Insights from deep learning

Metadata Updated: August 25, 2024

We developed a suite of models using deep learning to make hindcast predictions of the 7-day average backward-looking nitrate concentration at 46 predominantly agricultural sites across the midwestern and eastern United States. The models used daily observations of discharge and meteorological variables and static watershed attributes describing anthropogenic modification to hydrology, nitrogen application, climate, groundwater, land use and land cover, watershed physical attributes, and soils. Across all sites, discharge and watershed soil and physiographic attributes show a particularly strong influence on model performance. An analysis of drivers across sites revealed considerable regional differences related to controlling processes such as groundwater contributions. We tested several ways to pool data across sites to develop accurate models and make the most effective use of available data. Single-site models, in which models are trained and tested at a single location, showed generally strong predictive performance (median Kling-Gupta Efficiency = 0.66), and accuracy at poorly performing sites could be improved by grouping sites with similar characteristics. Developing a single model for all sites reduced performance at several locations with distinct characteristics, suggesting that there is a threshold of dissimilarity beyond which more data does not improve the model. While many deep learning studies have shown that national or even global models can outperform local models, it is not clear that this is true for water quality constituents. This study demonstrates how existing data can be combined effectively, using deep learning to develop accurate and interpretable models of instream nitrate at sites where varying processes are responsible for changes in nitrate concentration. This release provides code and data for running a suite of machine learning model to predict in stream nitrate concentration and using explainable AI to analyze model outputs and compare among modeling approaches.

Access & Use Information

Public: This dataset is intended for public access and use. License: No license information was provided. If this work was prepared by an officer or employee of the United States government as part of that person's official duties it is considered a U.S. Government Work.

Downloads & Resources

Dates

Metadata Created Date August 25, 2024
Metadata Updated Date August 25, 2024

Metadata Source

Harvested from DOI EDI

Additional Metadata

Resource Type Dataset
Metadata Created Date August 25, 2024
Metadata Updated Date August 25, 2024
Publisher U.S. Geological Survey
Maintainer
@Id http://datainventory.doi.gov/id/dataset/101847505cd7b82fa9559df4fea0c4a3
Identifier USGS:661436d1d34e633466530330
Data Last Modified 20240823
Category geospatial
Public Access Level public
Bureau Code 010:12
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Metadata Catalog ID https://datainventory.doi.gov/data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id 1611894f-4ff5-414c-a970-d7cd712f6fee
Harvest Source Id 52bfcc16-6e15-478f-809a-b1bc76f1aeda
Harvest Source Title DOI EDI
Metadata Type geospatial
Old Spatial -96.75,38.32,-74.22,44.53
Publisher Hierarchy White House > U.S. Department of the Interior > U.S. Geological Survey
Source Datajson Identifier True
Source Hash b42ad0f9a648e76563668f2f0e0796cd72dc1f963a54c2508a1356354e4ee028
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -96.75, 38.32, -96.75, 44.53, -74.22, 44.53, -74.22, 38.32, -96.75, 38.32}

Didn't find what you're looking for? Suggest a dataset here.