Skip to main content
U.S. flag

An official website of the United States government

Determining the Predictive Limit of QSAR Models

Published by U.S. EPA Office of Research and Development (ORD) | U.S. Environmental Protection Agency | Catalog Last Checked: April 21, 2026 at 08:01 PM | Dataset Last Updated: June 21, 2021
The research done to evaluate how the predictivity of models are effected by error in either the training or the test set is simple to describe conceptually. Benchmark datasets are downloaded from reputable sources. Then the datasets are split into training and test sets. Randomized error is added and then models created on both error laden and native training sets. Those models are used to predict both error laden and native test sets. Differences in standard statistics commonly used to assess predictivity are observed. This dataset is associated with the following publication: Kolmar, S., and C. Grulke. The Effect of Noise on the Predictive Limit of QSAR Models. Journal of Cheminformatics. Springer, New York, NY, USA, 13: 92, (2021).

Resources

1 resource available

  • https://github.com/USEPA/CompTox-ChemInf-ModelExperiments-ErrorEffects/tree/SIRepo

    FILE

Find Related Datasets

Search by Tags

Click any tag below to search for similar datasets

data.gov

An official website of the GSA's Technology Transformation Services

Looking for U.S. government information and services?
Visit USA.gov