Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

DNA sequences used to analyze evolutionary rates of transient receptor potential (Trp) genes in tetrapods

Metadata Updated: September 17, 2025

The dataset consists of two file types. The first type consists of sets of orthologous gene sequences of the transient receptor potential (Trp) superfamily of genes in FASTA format. The sequence data were obtained from the Gene database of the National Center for Biotechnology Information (NCBI, www.ncbi.nlm.nih.gov/gene), aligned at the codon level, and trimmed to a defined region present in each gene family member, termed the Trp box and C-terminal region. The sequences are grouped into files by gene and by taxonomic group. The second file type consists of phylogenetic tree files in Newick format that correspond with the taxa in the paired sequence file. Phylogenetic trees files in this format are often required for estimating the rate of evolution of a gene over an evolutionary period. The phylogenetic trees do not include or require branch lengths and were not generated from the Trp gene sequences themselves but from mitochondrial genome sequences or the literature as described herein. While the choice of genes, taxa, and tree labels were dictated by specific hypotheses for interpretive analysis, these files can be used for other purposes and modified accordingly. In addition to these two data components, there is also a descriptive file (list.of.accessions.txt) that links the taxon names in each sequence file to a permanent accession in the NCBI database from which the analyzed sequence was derived. The original accession identifiers were not used in the data files for readability and to maximize compatibility with different software packages that may not interpret special characters equivalently. Note the particular species for which the sequences were obtained are not germane to the analysis objectives, as long as they are representative of their taxonomic group and the sequences have low levels of missing data.

Access & Use Information

Public: This dataset is intended for public access and use. License: No license information was provided. If this work was prepared by an officer or employee of the United States government as part of that person's official duties it is considered a U.S. Government Work.

Downloads & Resources

Dates

Metadata Created Date September 12, 2025
Metadata Updated Date September 17, 2025

Metadata Source

Harvested from DOI USGS DCAT-US

Additional Metadata

Resource Type Dataset
Metadata Created Date September 12, 2025
Metadata Updated Date September 17, 2025
Publisher U.S. Geological Survey
Maintainer
Identifier http://datainventory.doi.gov/id/dataset/usgs-681a7715d4be0257c3c31bb2
Data Last Modified 2025-08-01T00:00:00Z
Category geospatial
Public Access Level public
Bureau Code 010:12
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Metadata Catalog ID https://ddi.doi.gov/usgs-data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id da3e72f7-ec07-4fc6-b046-00a965fbd810
Harvest Source Id 2b80d118-ab3a-48ba-bd93-996bbacefac2
Harvest Source Title DOI USGS DCAT-US
Metadata Type geospatial
Old Spatial -180.0000, -90.0000, 180.0000, 90.0000
Source Datajson Identifier True
Source Hash d0b52940175b24694dfe5363784cb0c7bbb6f1189855cc8d6034066b53d25f46
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -180.0000, -90.0000, -180.0000, 90.0000, 180.0000, 90.0000, 180.0000, -90.0000, -180.0000, -90.0000}

Didn't find what you're looking for? Suggest a dataset here.