Skip to main content
U.S. flag

An official website of the United States government

TV_VTT (TrecVid Video-To-Text) Dataset

Published by National Institute of Standards and Technology | National Institute of Standards and Technology | Catalog Last Checked: August 02, 2025 at 02:41 PM | Dataset Last Updated: January 06, 2025
This dataset contains short videos (ranging from 3 seconds to 10 seconds) from TRECVID VTT task from 2016 to 2024. There are 73,893 videos with captions. Each video has between 2 and 5 captions, which have been written by dedicated annotators hired by NIST.

Resources

4 resources available

  • DOI Access for TV_VTT (TrecVid Video-To-Text) Dataset

    FILE
  • VIDEOS ARE IN MP4 AND CAPTIONS ARE IN PLAIN TEXT.

    TV_VTT

    VIDEOS ARE IN MP4 AND CAPTIONS ARE IN PLAIN TEXT.
  • Readme

    TXT
  • TEXT FILE

    data agreement form

    TEXT FILE

Find Related Datasets

Search by Tags

Click any tag below to search for similar datasets

data.gov

An official website of the GSA's Technology Transformation Services

Looking for U.S. government information and services?
Visit USA.gov