Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Cyanobacteria Aggregated Manual Labels

Metadata Updated: September 19, 2025

Continuous monitoring for cyanobacteria blooms in small, inland water bodies via in-situ sampling and analysis can be challenging not only due to the number and locations of water bodies to cover, but also due to the dynamic nature of algal growth and toxin production. Detection targets vary with cyanobacteria strains as well as physical, chemical, and biological factors. Ground monitoring also lacks consistency as sampling methods, frequency, and analytical techniques vary from region to region. However, remote sensing allows systematic data collection over a large area to identify regions with potential harmful algal growth. We introduce the Cyanobacteria Aggregated Manual Labels (CAML), a large dataset of in-situ cyanobacteria measurements for investigations of cyanobacteria detection and severity classification in inland water bodies across the United States. Relevant satellite imagery from publicly available endpoints are applicable to use when applying the CAML dataset to models. The dataset labels ground measurements of cyanobacteria cell counts at 23,570 points in U.S. inland water bodies over 2013 2021. Algorithms trained on this data could be used to estimate cyanobacteria cell counts in water bodies for timely water quality and public health interventions and to gain an understanding of environmental and anthropogenic factors associated with cyanobacteria incidence and proliferation. Data is provided in a comma-separated values (CSV) format.

Access & Use Information

Public: This dataset is intended for public access and use. License: No license information was provided. If this work was prepared by an officer or employee of the United States government as part of that person's official duties it is considered a U.S. Government Work.

Downloads & Resources

Dates

Metadata Created Date April 9, 2025
Metadata Updated Date September 19, 2025

Metadata Source

Harvested from NASA Data.json

Additional Metadata

Resource Type Dataset
Metadata Created Date April 9, 2025
Metadata Updated Date September 19, 2025
Publisher NASA/GSFC/SED/ESD/GCDC/OB.DAAC;NASA/GSFC/SED/ESD/GCDC/SeaBASS
Maintainer
Identifier 10.5067/SeaBASS/CAML/DATA001
Data Last Modified 2025-09-11
Category Earth Science
Public Access Level public
Bureau Code 026:00
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id 441b1755-5841-4b2a-a5c5-3f15cd2ab171
Harvest Source Id 58f92550-7a01-4f00-b1b2-8dc953bd598f
Harvest Source Title NASA Data.json
Homepage URL https://seabass.gsfc.nasa.gov/experiment/CAML/
Old Spatial {"EastBoundingCoordinate":180.0,"NorthBoundingCoordinate":90.0,"SouthBoundingCoordinate":-90.0,"WestBoundingCoordinate":-180.0},"CARTESIAN"
Program Code 026:000
Source Datajson Identifier True
Source Hash 69f4570365f2232903e34d9021772b6296ebe9ba820e8d5a240474aba393209b
Source Schema Version 1.1
Spatial
Temporal 2013-01-04/2013-01-04

Didn't find what you're looking for? Suggest a dataset here.