Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Statistical predictions of groundwater levels and related spatial diagnostics for the Mississippi River Valley alluvial aquifer from the mmlMRVAgen1 statistical machine-learning software, GeoTIFF formatted

Metadata Updated: November 27, 2025

A multiple machine-learning model (Asquith and Killian, 2024) implementing Cubist and Random Forest regressions was used to predict monthly mean groundwater levels through time for the available years described in the metadata for the Mississippi River Valley alluvial aquifer (MRVA). The MRVA is the surficial aquifer of the Mississippi Alluvial Plain (MAP), located in the south-central United States. Employing two machine-learning techniques offered the opportunity to generate model and statistical error and covariance between them to estimate total uncertainty. Potentiometric surface predictions were made at the 1-kilometer grid scale using the National Hydrogeologic Grid (Clark and others, 2018). Results produced by the mmlMRVAgen1 software have been condensed into thirteen themes each containing a multi-banded GeoTIFF raster with 504 layers, corresponding to each month for the available years described in the metadata. The themes include the final predicted monthly water-level altitudes, in feet North American Vertical Datum of 1988 (NAVD88), for the study area (pol), which were computed by pooling through weighted-mean averaging by cell the even and odd year predictions for that month. The depth to water was predicted in feet (nhgd2w), utilizing the NHG cell altitude as the land surface datum. Model errors were evaluated using both the normal error (modnorerr) in standard deviations of feet and the polynomial-density-quantile4 distribution (PDQ4)-error model (modpdqerr) without the inclusion of land-surface variation of the NHG. The equivalent standard deviations of these error models were calculated both with and without the inclusion of land-surface variation of the NHG (norerr, norerrnhg, pdqerr, pdqerrnhg). The lower and upper bounds (in feet) of the 90-percent prediction limits for both model error forms were computed (norlwr, norupr, pdqlwr, pdqupr). Lastly, the ratio of model error to total error (modtotrat) was also computed. Complementing each of the GeoTIFFs are .json extensions to each file. These provide additional multi-band support information. This double-file representation stems from the native GeoTIFF drivers within the terra R package underpinning the operations. Overall, the model objects created by the mmlMRVAgen1 from about 156,000 water-level records for about 58,000 wells report (1) a normalized Nash−Sutcliffe Efficiency (NNSE) of about 0.997, (2) a root-mean-square error (RMSE) of about 4.15 feet, and (3) a bias prior to computing the NNSE and RMSE of about 0.0963 feet before its subsequent removal (see mmlMRVAgen1 software diagnostics associated with "MRVA_MML_CONSTANTS"). The model objects also report for the 156,000 water-level records (1) a mean percent ratio of model error to total error of about 69.2 percent and (2) a mean width of about 12.05 feet for the 90-percent prediction bounds from the PDQ4 error framework (see mmlMRVAgen1 software diagnostics associated with "genMML/03step.R"). The model objects were used in post-model creation to predict each of the rasters provided in this data release. (Note, the results herein are associated with the "April 21, 2024" model run, see mmlMRVAgen1/model_archive/README.md.) For a full description of covariate assemblage and hydrograph modeling, see Asquith and Killian (2022) (covMRVAgen1 software). For a full description of multiple machine-learning modeling, see Asquith and Killian (2024) (mmlMRVAgen1 software).

Access & Use Information

Public: This dataset is intended for public access and use. License: No license information was provided. If this work was prepared by an officer or employee of the United States government as part of that person's official duties it is considered a U.S. Government Work.

Downloads & Resources

Dates

Metadata Created Date September 13, 2025
Metadata Updated Date November 27, 2025

Metadata Source

Harvested from DOI USGS DCAT-US

Additional Metadata

Resource Type Dataset
Metadata Created Date September 13, 2025
Metadata Updated Date November 27, 2025
Publisher U.S. Geological Survey
Maintainer
Identifier http://datainventory.doi.gov/id/dataset/usgs-65f30833d34e9853bbf0cb2b
Data Last Modified 2024-08-30T00:00:00Z
Category geospatial
Public Access Level public
Bureau Code 010:12
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Metadata Catalog ID https://ddi.doi.gov/usgs-data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id c99caf8c-fd61-40b9-a34c-4f37584feacb
Harvest Source Id 2b80d118-ab3a-48ba-bd93-996bbacefac2
Harvest Source Title DOI USGS DCAT-US
Metadata Type geospatial
Source Datajson Identifier True
Source Hash f30e3f5db512af5080e22f39b71464ea23ab4d7bd833113dadb2f15eb532da40
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.