Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Protein Clusters

Metadata Updated: June 19, 2025

A collection of Reference Sequence (RefSeq) proteins, from the complete genomes of prokaryotes, plasmids, and organelles, that have been grouped and annotated based on sequence similarity and protein function.

Access & Use Information

Public: This dataset is intended for public access and use. License: See this page for license information.

Downloads & Resources

Dates

Metadata Created Date March 2, 2022
Metadata Updated Date June 19, 2025

Metadata Source

Harvested from Healthdata.gov

Additional Metadata

Resource Type Dataset
Metadata Created Date March 2, 2022
Metadata Updated Date June 19, 2025
Publisher National Library of Medicine
Maintainer
Identifier https://datadiscovery.nlm.nih.gov/api/views/qmpj-ixje
Data First Published 2022-03-01
Data Last Modified 2025-06-18
Category Biology
Public Access Level public
Bureau Code 009:25
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Metadata Catalog ID https://healthdata.gov/data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id 6293e46e-7ee7-4e12-8505-e9cc89f77d8c
Harvest Source Id 651e43b2-321c-4e4c-b86a-835cfc342cb0
Harvest Source Title Healthdata.gov
Homepage URL https://www.ncbi.nlm.nih.gov/proteinclusters
License http://opendefinition.org/licenses/odc-odbl/
Program Code 009:041
Source Datajson Identifier True
Source Hash 47d27a682b0ddd445333ee750e182a5915603f8f69daacbbe8b2028827e9ae5c
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.