Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

IARPA BETTER (Better Extraction from Text Towards Enhanced Retrieval) information extraction and information retrieval datasets.

Metadata Updated: May 15, 2024

Cross-language information extraction and retrieval datasets developed for the evaluation of the IARPA BETTER program. The documents come from CommonCrawl. The IE annotations in three schemas are by MITRE and ARLIS. The IR queries and relevance judgments were done at NIST, and NIST was asked by IARPA to distribute the data in its final form. The tasks are all cross-language from English into one of Arabic, Farsi, Russian, Chinese, and Korean

Access & Use Information

Public: This dataset is intended for public access and use. License: See this page for license information.

Downloads & Resources

Dates

Metadata Created Date May 15, 2024
Metadata Updated Date May 15, 2024
Data Update Frequency irregular

Metadata Source

Harvested from NIST

Additional Metadata

Resource Type Dataset
Metadata Created Date May 15, 2024
Metadata Updated Date May 15, 2024
Publisher National Institute of Standards and Technology
Maintainer
Identifier ark:/88434/mds2-2946
Data First Published 2024-04-22
Language en
Data Last Modified 2023-02-24 00:00:00
Category Information Technology:Data and informatics
Public Access Level public
Data Update Frequency irregular
Bureau Code 006:55
Metadata Context https://project-open-data.cio.gov/v1.1/schema/data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id 43471760-18f9-4233-a01c-0ca793b80578
Harvest Source Id 74e175d9-66b3-4323-ac98-e2a90eeb93c0
Harvest Source Title NIST
Homepage URL https://ir.nist.gov/better/
License https://www.nist.gov/open/license
Program Code 006:045
Source Datajson Identifier True
Source Hash 18463fe7ad69377c55cfab7bb3b367aa3ae5b533b58732095c2677b353916ad8
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.