Model output and code for a long short-term memory model forecasting streamflow drought across the Conterminous United States by focusing on percentiles below 50 percent
Machine learning (ML) models were used to forecast streamflow drought from 1 to 13 weeks into the future at more than 3,000 streamgage locations across the conterminous United States. Long short-term memory [LSTM] neural network models were used to predict weekly streamflow percentiles with independent models for each forecast horizon. In this data release we specifically include an LSTM model where only percentiles below 50% (LSTM<50) are prioritized. Model performance was evaluated regionally and nationally for drought occurrence (the classification performance for a future date) and for drought onset and termination (performance identifying drought starts and ends).
This data release contains model outputs (model_outputs.zip) and model code (model_code.zip). The model_outputs.zip folder contains files with model performance data (performance_data.feather) and observed and predicted streamflow percentile time series data (timeseries_data.feather). The model_code.zip folder contains Jupyter notebooks with Python scripts for running the LSTM models.
Find Related Datasets
Search by Tags
Click any tag below to search for similar datasets
Complete Metadata
| @type | dcat:Dataset |
|---|---|
| accessLevel | public |
| bureauCode |
[
"010:12"
]
|
| contactPoint |
{
"fn": "John C Hammond",
"@type": "vcard:Contact",
"hasEmail": "mailto:jhammond@usgs.gov"
}
|
| description | Machine learning (ML) models were used to forecast streamflow drought from 1 to 13 weeks into the future at more than 3,000 streamgage locations across the conterminous United States. Long short-term memory [LSTM] neural network models were used to predict weekly streamflow percentiles with independent models for each forecast horizon. In this data release we specifically include an LSTM model where only percentiles below 50% (LSTM<50) are prioritized. Model performance was evaluated regionally and nationally for drought occurrence (the classification performance for a future date) and for drought onset and termination (performance identifying drought starts and ends). This data release contains model outputs (model_outputs.zip) and model code (model_code.zip). The model_outputs.zip folder contains files with model performance data (performance_data.feather) and observed and predicted streamflow percentile time series data (timeseries_data.feather). The model_code.zip folder contains Jupyter notebooks with Python scripts for running the LSTM models. |
| distribution |
[
{
"@type": "dcat:Distribution",
"title": "Digital Data",
"format": "XML",
"accessURL": "https://doi.org/10.5066/P13HFSYE",
"mediaType": "application/http",
"description": "Landing page for access to the data"
},
{
"@type": "dcat:Distribution",
"title": "Original Metadata",
"format": "XML",
"mediaType": "text/xml",
"description": "The metadata original format",
"downloadURL": "https://data.usgs.gov/datacatalog/metadata/USGS.695bf56fd4be02126e7fb5a6.xml"
}
]
|
| identifier | http://datainventory.doi.gov/id/dataset/USGS_695bf56fd4be02126e7fb5a6 |
| keyword |
[
"USGS:695bf56fd4be02126e7fb5a6",
"autoregressive integrated moving average",
"deep learning",
"drought",
"gradient boosting machine",
"hydrology",
"inlandWaters",
"long short-term memory",
"machine learning",
"modeling",
"neural network",
"streamflow",
"time series",
"water resources"
]
|
| modified | 2026-01-22T00:00:00Z |
| publisher |
{
"name": "U.S. Geological Survey",
"@type": "org:Organization"
}
|
| spatial | -124.7565, 24.5232, -66.9491, 49.3845 |
| theme |
[
"geospatial"
]
|
| title | Model output and code for a long short-term memory model forecasting streamflow drought across the Conterminous United States by focusing on percentiles below 50 percent |