Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Select elements of concern in surface water of three hydrologic basins (Delaware River, Illinois River and Upper Colorado River) - Data screening for the development of spatial and temporal models

Metadata Updated: September 12, 2025

This data release is focused on the analysis of surface water concentration data associated with 12 elements of concern from three hydrologic basins. Data is analyzed with respect to: a) reporting limits, b) the extent of censored data, c) co-location with USGS real-time sensor data, and d) median concentrations at the catchment spatial scale. The Proxies Project (under the Water Quality Program of the USGS Water Mission Area) is a multi-year effort designed to develop rapid and cost-effective approaches for monitoring and risk assessment of a range of aquatic contaminants in riverine surface waters at multiple spatial scales. One component of this project is focused on 12 Elements of Concern (EoC; Al, As, Cd, Cr, Cu, Fe, Hg, Mn, Pb, Se, U and Zn) in three primary hydrologic basins: Delaware River Basin (DRB), the Illinois River Basin (ILRB) and the Upper Colorado (UCOL) River Basin (USGS, 2023). Two modeling approaches being explored as part of the Proxies Project rely on the analysis of previously published EoC concentration data retrieved from the multi-agency supported Water Quality Portal (www.waterqualitydata.us/). This basin-specific retrieved data, covering the 1900-2022 timeframe, was subsequently screened, harmonized and published as part of an earlier USGS Data Release (Marvin-DiPasquale and others, 2022).
The two distinct modeling approaches that leverage this previously published data are: a) machine learning statistical analysis of EoC concentration distributions as a function of geospatial attributes; and b) time series analysis in support of estimating EoC concentrations in (near)real-time at a sub-set of USGS real-time stations using discharge in combination with a range of deployed in-situ sensors.
Prior to the final stages of model development, there were several data analysis steps required to further define which elements and aquatic fractions (i.e. filtered, unfiltered, and particulate) best lend themselves to further model exploration and development. These intermediate data analyses include: a) an analysis of the change in detection quantitation limits, by element and methods over time (DR_Table _1); b) an analysis of data censoring, by study basin, element, and fraction (DR_Table_2); c) a calculation of median EoC concentrations at the National Hydrography Dataset Plus (NHDPlus) catchment spatial scale (DR_Table_3); d) an analysis of the percentage of censored median EoC concentration values by study basin, element, and fraction (DR_Table_4); e) decision tree analysis associated with the geospatial machine learning modeling approach, by study basin, element and fraction (DR_Table_5); f) discrete EoC concentration data merged with continuous discharge and in-situ sensor data at USGS real-time stations, by station ID, element and fraction (DR_Table_6); and g) an analysis of the total number of observations and the percentage of censored EoC data associated with the merged discrete EoC and continuous discharge and sensor data retrieved from USGS real-time stations, by station ID, element, and fraction (DR_Table_7). The current data release documents the results of these data analyses. The associated seven data tables presented herein are provided in machine-readable comma separated value (*.csv) format and are more fully described in the associated meta-data. REFERENCES Marvin-DiPasquale, M.C., Sullivan, S.L., Platt, L. R., Gorsky, A., Agee, J.L., McCleskey, B.R., Kakouros, E., Walton-Day, K., Runkel, R. L., Morriss, M. C., Wakefield, B. F., and Bergamaschi, B., 2022, Concentration Data for 12 Elements of Concern Used in the Development of Surrogate Models for Estimating Elemental Concentrations in Surface Water of Three Hydrologic Basins (Delaware River, Illinois River and Upper Colorado River): U.S. Geological Survey data release, https://doi.org/10.5066/P9L06M3G. USGS, 2023, Proxies Project, U.S. Geological Survey webpage, accessed 3/11/2025, https://www.usgs.gov/mission-areas/water-resources/science/proxies-project

Access & Use Information

Public: This dataset is intended for public access and use. License: No license information was provided. If this work was prepared by an officer or employee of the United States government as part of that person's official duties it is considered a U.S. Government Work.

Downloads & Resources

Dates

Metadata Created Date September 12, 2025
Metadata Updated Date September 12, 2025

Metadata Source

Harvested from DOI USGS DCAT-US

Additional Metadata

Resource Type Dataset
Metadata Created Date September 12, 2025
Metadata Updated Date September 12, 2025
Publisher U.S. Geological Survey
Maintainer
Identifier http://datainventory.doi.gov/id/dataset/usgs-64e8ec5bd34ec376dc879b9b
Data Last Modified 2025-08-07T00:00:00Z
Category geospatial
Public Access Level public
Bureau Code 010:12
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Metadata Catalog ID https://ddi.doi.gov/usgs-data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id 91d8460e-cfa3-4907-add2-e5529d93898c
Harvest Source Id 2b80d118-ab3a-48ba-bd93-996bbacefac2
Harvest Source Title DOI USGS DCAT-US
Metadata Type geospatial
Old Spatial -109.0271, 36.8673, -73.9961, 43.1283
Source Datajson Identifier True
Source Hash d8654d37b332403af0ec2b7673a797f01f06fc567318432b40a94c74dff0041e
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -109.0271, 36.8673, -109.0271, 43.1283, -73.9961, 43.1283, -73.9961, 36.8673, -109.0271, 36.8673}

Didn't find what you're looking for? Suggest a dataset here.