Skip to main content
U.S. flag

An official website of the United States government

NIST test dataset for assessing baseline nucleic acid sequence screening

Published by National Institute of Standards and Technology | National Institute of Standards and Technology | Catalog Last Checked: August 02, 2025 at 03:57 PM | Dataset Last Updated: August 09, 2024
This repository contains the dataset used in the manuscript "Inter-tool analysis of a NIST dataset for assessing baseline nucleic acid sequence screening". NIST constructed the test dataset based on the current screening recommendations from HHS. The dataset is a FASTA formatted file with blinded numerical sequence headers. The dataset was sent to sequence screening tool developers for initial testing and to obtain feedback about its utility for assessing baseline sequence screening. An additional metadata file provides the NIST-assigned label for each sequence, along with a more detailed description derived from the source database.

Resources

3 resources available

  • NIST_nucleic_acid_synthesis_screening_test_dataset

    FASTA
  • README

    TEXT/MARKDOWN
  • NIST_nucleic_acid_syntheisis_screening_test_dataset_metadata

    TEXT/TAB-SEPARATED-VALUES

Find Related Datasets

Search by Tags

Click any tag below to search for similar datasets

data.gov

An official website of the GSA's Technology Transformation Services

Looking for U.S. government information and services?
Visit USA.gov