Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Trojan Detection Software Challenge - nlp-summary-jan2022-holdout

Metadata Updated: September 30, 2023

Round 9 Holdout DatasetThis is the holdout data used to evaluate trojan detection software solutions. This data, generated at NIST, consists of natural language processing (NLP) AIs trained to perform one of three tasks, sentiment classification, named entity recognition, or extractive question answering on English text. A known percentage of these trained AI models have been poisoned with a known trigger which induces incorrect behavior. This data will be used to develop software solutions for detecting which trained AI models have been poisoned via embedded triggers. This dataset consists of 410 Sentiment Classification, Named Entity Recognition, and Extractive Question Answering AI models using a small set of model architectures. Half (50%) of the models have been poisoned with an embedded trigger which causes misclassification of the input when the trigger is present.

Access & Use Information

Public: This dataset is intended for public access and use. License: See this page for license information.

Downloads & Resources


Metadata Created Date May 9, 2023
Metadata Updated Date September 30, 2023
Data Update Frequency irregular

Metadata Source

Harvested from NIST

Additional Metadata

Resource Type Dataset
Metadata Created Date May 9, 2023
Metadata Updated Date September 30, 2023
Publisher National Institute of Standards and Technology
Identifier ark:/88434/mds2-2782
Data First Published 2023-04-07
Language en
Data Last Modified 2022-01-31 00:00:00
Category Information Technology:Software research, Information Technology:Cybersecurity
Public Access Level public
Data Update Frequency irregular
Bureau Code 006:55
Metadata Context
Schema Version
Catalog Describedby
Harvest Object Id 8aeab931-ef31-4736-b37f-ea2303c14229
Harvest Source Id 74e175d9-66b3-4323-ac98-e2a90eeb93c0
Harvest Source Title NIST
Homepage URL
Program Code 006:045
Source Datajson Identifier True
Source Hash ba9546b664d2d3b35e644741cbdbe0f6f478348f0d63ef57e894d526ee7b27e4
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.