Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Functional annotation for 15 diverse arthropod genomes

Metadata Updated: March 30, 2024

We present the annotation results of 15 arthropod proteomes using an open source, open access and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to a diverse range of arthropod species. You can find more information about the pipeline at our readthedocs site. The files for each genome include GOanna, InterproScan and KOBAS predictions. Arthropod genomes selected for this study and their assembly and annotation statistics.

Apis Mellifera (honey bee) Drosophila melanogaster (fruit fly) Tribolium castaneum (red flour beetle) Latrodectus hesperus (Western black widow spider) Limnephilus lunatus (caddisfly) Oncopeltus fasciatus (Large milkweed bug) Homalodisca vitripennis (Glassy-winged sharpshooter) Eurytemora affinis (calanoid copepod) Agrilus planipennis (emerald ash borer) Copidosoma floridanum (parasitoid wasp) Athalia rosae (turnip sawfly) Ceratitis capitata (Mediterranean fruit fly) Cimex lectularius (Cimicidae bed bug) Varroa destructor(parasitic mite)

Diaphorina citri (Asian citrus psyllid) Resources in this dataset:Resource Title: Cimex lectularius (Cimicidae bed bug) annotation. File Name: CLEC.tar.gzResource Description: Functional annotation for Clec-OGSv1.2 protein setResource Title: Tribolium castaneum (red flour beetle) annotation. File Name: TCAS.tar.gzResource Description: Functional annotation for TCAS_OGS_v3 protein setResource Title: Drosophila melanogaster (fruit fly) annotation. File Name: DMEL.tar.gzResource Description: Functional annotation for DMEL_r6.38 protein set

Resource Title: Varroa destructor (parasitic mite) annotation. File Name: VDES.tar.gzResource Description: Functional annotation for NCBI Varroa destructor Annotation Release 100 protein set based on Vdes_3.0 genome (GCA_002443255.1) Resource Title: Oncopeltus fasciatus (Large milkweed bug) annotation. File Name: ONCFAS.tar.gzResource Description: Functional annotation for oncfas_OGSv1.2 protein setResource Title: Apis Mellifera (honey bee) annotation. File Name: AMEL.tar.gzResource Description: Functional annotation for OGSv3.3 protein set from Amel_4.5 genome (GCA_000002195.1) Resource Title: Homalodisca vitripennis (Glassy-winged sharpshooter) annotation. File Name: HVIT.tar.gzResource Description: Functional annotation for HVIT-BCM_version_0.5.3 protein set based on Hvit_1.0 genome (GCA_000696855.1) Resource Title: Limnephilus lunatus (caddisfly) annotation. File Name: LLUN.tar.gzResource Description: Functional annotation for LLUN-BCM_version_0.5.3 protein set from Llun_1.0 genome (GCA_000648945.1) Resource Title: Latrodectus hesperus (Western black widow spider) annotation. File Name: LHES.tar.gzResource Description: Functional annotation for LHES-BCM_version_0.5.3 protein set from Lhes_1.0 genome (GCA_000697925.1) Resource Title: Eurytemora affinis (calanoid copepod) annotation. File Name: EAFF.tar.gzResource Description: Functional annotation for EAFF-BCM_version_0.5.3 protein set from Eaff_1.0 genome (GCA_000591075.1) Resource Title: Copidosoma floridanum (parasitoid wasp) annotation. File Name: CFLO.tar.gzResource Description: Functional annotation for CFLO-BCM_version_0.5.3 protein set based on Cflo_1.0 genome (GCA_000648655.1) Resource Title: Ceratitis capitata (Mediterranean fruit fly) annotation. File Name: CCAP.tar.gzResource Description: Functional annotation for Ccap-OGSv1 protein set based on Ccap_1.1 assembly (GCA_000347755.2) Resource Title: Athalia rosae (turnip sawfly) annotation. File Name: AROS.tar.gzResource Description: Functional annotation for AROS-BCM_version_0.5.3 protein set based on Aros_1.0 genome (GCA_000344095.1)Resource Title: Agrilus planipennis (emerald ash borer) annotation. File Name: APLA.tar.gzResource Description: Functional annotation for APLA-BCM_version_0.5.3 protein set based on Apla_1.0 genome (GCA_000699045.1)

Access & Use Information

Public: This dataset is intended for public access and use. License: Creative Commons CCZero

Downloads & Resources

Dates

Metadata Created Date March 30, 2024
Metadata Updated Date March 30, 2024

Metadata Source

Harvested from USDA JSON

Additional Metadata

Resource Type Dataset
Metadata Created Date March 30, 2024
Metadata Updated Date March 30, 2024
Publisher Agricultural Research Service
Maintainer
Identifier 10.15482/USDA.ADC/1522860
Data Last Modified 2023-11-30
Public Access Level public
Bureau Code 005:18
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id 30a9640b-8f59-43d4-8d44-bb44c052b1b0
Harvest Source Id d3fafa34-0cb9-48f1-ab1d-5b5fdc783806
Harvest Source Title USDA JSON
License https://creativecommons.org/publicdomain/zero/1.0/
Old Spatial {"type": "Polygon", "coordinates": -125.33203125, 30.654452824401, -125.33203125, 48.848450835898, -74.35546875, 48.848450835898, -74.35546875, 30.654452824401, -125.33203125, 30.654452824401}
Program Code 005:040
Source Datajson Identifier True
Source Hash 5404d7fc1d0bc6f095ac40ab6358aa50eba78d410ef01c9ac88cad06d228b1ae
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -125.33203125, 30.654452824401, -125.33203125, 48.848450835898, -74.35546875, 48.848450835898, -74.35546875, 30.654452824401, -125.33203125, 30.654452824401}

Didn't find what you're looking for? Suggest a dataset here.