Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Identifying Prevalent Chemical Mixtures in the US Population EHP Data

Metadata Updated: November 12, 2020

Frequent itemset mining (FIM), a technique used for finding patterns in consumer purchasing behavior, can be applied to data from large-scale biomonitoring studies to identify combinations of chemicals that frequently co-occur in people. As a proof of concept, we applied FIM to biomonitoring data from the National Health and Nutrition Examination Survey. In this way, we identified 90 chemical combinations consisting of relatively few chemicals that occur in at least 30% of the US population, as well as 3 super-combinations consisting of relatively many chemicals that occur in a small but non-negligible proportion of the US population. Thus, we have demonstrated a technique for narrowing a large number of possible chemical combinations down to a much smaller collection of prevalent chemical combinations.

This dataset is associated with the following publication: Kapraun, D.F., J.F. Wambaugh, R. Tornero-Velez, and R.W. Setzer. (ENVIRONMENTAL HEALTH PERSPECTIVES) Identifying Prevalent Chemical Mixtures in the US Population. ENVIRONMENTAL HEALTH PERSPECTIVES. National Institute of Environmental Health Sciences (NIEHS), Research Triangle Park, NC, USA, 125(8): 1-16, (2017).

Access & Use Information

Public: This dataset is intended for public access and use. License: See this page for license information.

Downloads & Resources

References

https://doi.org/10.1289/ehp1265

Dates

Metadata Created Date November 12, 2020
Metadata Updated Date November 12, 2020

Metadata Source

Harvested from EPA ScienceHub

Additional Metadata

Resource Type Dataset
Metadata Created Date November 12, 2020
Metadata Updated Date November 12, 2020
Publisher U.S. EPA Office of Research and Development (ORD)
Maintainer
Identifier https://doi.org/10.23719/1395052
Data Last Modified 2017-08-24
Public Access Level public
Bureau Code 020:00
Schema Version https://project-open-data.cio.gov/v1.1/schema
Harvest Object Id 4e266595-1f66-4359-ac69-706cd2dbd544
Harvest Source Id 04b59eaf-ae53-4066-93db-80f2ed0df446
Harvest Source Title EPA ScienceHub
License https://pasteur.epa.gov/license/sciencehub-license.html
Program Code 020:095
Publisher Hierarchy U.S. Government > U.S. Environmental Protection Agency > U.S. EPA Office of Research and Development (ORD)
Related Documents https://doi.org/10.1289/ehp1265
Source Datajson Identifier True
Source Hash 7076d61af2bdf3f0c09320425ed5eb06121c97df
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.