Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Supplementary data for "Distributions of fitness effects for amino acid changes from high-throughout mutagenesis experiments" (McCandlish and Stoltzfus, 2018)

Metadata Updated: September 30, 2025

McCandlish and Stoltzfus gathered data from deep mutational scanning experiments on 12 proteins, comprising 56641 distinct amino acid replacement mutations. By converting fitnesses to within-study quantiles, they combined results from all studies to draw general conclusions about distributions of fitness effects for the 380 different types of possible amino acid changes in proteins. They found that most replacements are neither conservative nor radical, but barely different from the background distribution. The shapes of these distributions can be approximated by a maximum-entropy model with only 1 parameter. This data package makes it possible to reproduce the main calculations used by Stoltzfus and McCandlish. The data also may be useful to researchers carrying out meta-analyses of mutation-scanning experiments or DFE experiments.

Access & Use Information

Public: This dataset is intended for public access and use. License: See this page for license information.

Downloads & Resources

Dates

Metadata Created Date November 12, 2020
Metadata Updated Date September 30, 2025

Metadata Source

Harvested from Commerce Non Spatial Data.json Harvest Source

Additional Metadata

Resource Type Dataset
Metadata Created Date November 12, 2020
Metadata Updated Date September 30, 2025
Publisher National Institute of Standards and Technology
Maintainer
Identifier 762AD4DA63D05927E05324570681D36C1970
Language en
Data Last Modified 2018-09-18 00:00:00
Category Mathematics and Statistics:Numerical methods and software, Chemistry:Molecular characterization, Bioscience:Engineering/synthetic biology
Public Access Level public
Bureau Code 006:55
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Data Dictionary https://data.nist.gov/od/ds/762AD4DA63D05927E05324570681D36C1970/README.md
Harvest Object Id 0da9c9c0-db7f-492f-adaf-009f722d61c5
Harvest Source Id bce99b55-29c1-47be-b214-b8e71e9180b1
Harvest Source Title Commerce Non Spatial Data.json Harvest Source
Homepage URL https://data.nist.gov/od/id/762AD4DA63D05927E05324570681D36C1970
License https://www.nist.gov/open/license
Program Code 006:045
Source Datajson Identifier True
Source Hash 3dfcd0fd106795cc4b55f4b4b66bae476292bde812d9445bab59e4e6cccd4a60
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.