Halyomorpha halys Official Gene Set v1.2

Metadata Updated: November 10, 2020

This dataset presents the Halyomorpha halys Official Gene Set (OGS) v1.2. OGSv1.2 is an update of Halyomorpha halys OGSv1.1 (https://doi.org/10.15482/USDA.ADC/1504240) to the coordinates of genome assembly GCA_000696795.3 (https://www.ncbi.nlm.nih.gov/assembly/GCA_000696795.3) using https://github.com/NAL-i5K/coordinates_conversion/. The original OGSv1.0 is an integration of automatic gene predictions from NCBI's eukaryotic annotation pipeline, NCBI Halyomorpha halys Annotation Release 100 (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Halyomorpha_halys/100/; ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/696/795/GCF_000696795.1_H...), with manual annotations by the research community (performed via the Apollo manual curation software, http://genomearchitect.org/). Manual annotations performed by the community were downloaded from Apollo, QC'd, and merged with NCBI Halyomorpha halys Annotation Release 100 using the GFF3toolkit software (https://github.com/NAL-i5K/GFF3toolkit/releases/tag/v1.4.4). The resulting merged dataset was formatted for ingest into the i5k Workspace and GenBank databases, resulting in Halyomorpha halys Official Gene Set (OGS) v1.0. Halyomorpha Official Gene Set halhal_OGSv1.1 is a minor update of halhal_OGSv1.0: Alias attributes were added to all manually annotated cathepsin models; six models from contaminated scaffolds were removed; and notes were added to 3 models located on possibly contaminated scaffolds.

Access & Use Information

Public: This dataset is intended for public access and use. License: Creative Commons CCZero

Downloads & Resources


Metadata Created Date November 10, 2020
Metadata Updated Date November 10, 2020

Metadata Source

Harvested from USDA JSON

Additional Metadata

Resource Type Dataset
Metadata Created Date November 10, 2020
Metadata Updated Date November 10, 2020
Publisher Agricultural Research Service
Unique Identifier Unknown
Identifier 9b973203-90bd-463c-84b5-5f855f689846
Data Last Modified 2020-06-03
Public Access Level public
Bureau Code 005:18
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id f1b460e0-de9c-49d9-a01c-8dd0b3ef17d1
Harvest Source Id d3fafa34-0cb9-48f1-ab1d-5b5fdc783806
Harvest Source Title USDA JSON
License https://creativecommons.org/publicdomain/zero/1.0/
Program Code 005:040
Source Datajson Identifier True
Source Hash 55f6d95f08d486fcc2ef831f0f11a80576c3b737
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.