Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

De novo transcriptome assembly and annotations for wheat curl mite (Aceria tosichella)

Metadata Updated: March 30, 2024

To study the impact of wheat streak mosaic virus on global gene expression in wheat curl mite, we generated a de novo transcriptome assembly using 50 x 50 paired end reads from the Illumina HiSeq 2500. Reads were assembled using Trinity (version 2.0.6) and contigs greater than 200 nt were retained. All assembled transcripts were annotated using the Trinotate pipeline using blastp searches against the Swiss-prot/Uni-Prot database, blastx searches against the Swiss-prot/Uni-Prot databases, HMM searches against the Pfam-A database, blastp searches against the non-redundant protein database, and signalP and tmHMM predictions. To reduce noise from low abundance transcripts not well supported by the data, we filtered the assembly to retain only those transcripts with TPM values >=0.5. Resources in this dataset:Resource Title: Raw Trinity Assembly. File Name: Trinity.fasta.txtResource Description: Raw trinity assembly obtained from wheat curl mite using 50 x 50 Illumina paired end reads from the HiSeq2500.Resource Software Recommended: Notepad++,url: https://notepad-plus-plus.org/ Resource Title: Raw Trinity Assembly. File Name: Trinity.fasta.txtResource Description: Raw trinity assembly obtained from wheat curl mite using 50 x 50 Illumina paired end reads from the HiSeq2500.Resource Software Recommended: Text wrangler,url: https://itunes.apple.com/us/app/textwrangler/id404010395?mt=12 Resource Title: Trinotate annotations for raw Trinity assembly. File Name: trinotate_annotations_report.xlsResource Description: Trinotate results for raw wheat curl mite transcriptome assemblyResource Software Recommended: Excel,url: https://products.office.com/en-us/excel Resource Title: Trinotate annotations for raw Trinity assembly. File Name: trinotate_annotations_report.xlsResource Description: Trinotate results for raw wheat curl mite transcriptome assemblyResource Software Recommended: Libre Office Calc,url: https://www.libreoffice.org/discover/calc/ Resource Title: Blastp results versus non-redundant protein database. File Name: wheat_curl_mite_blastp_nr.txtResource Description: Blastp results for protein coding unigenes from raw Trinity transcriptome assembly (wheat curl mite). Output format is default. Resource Software Recommended: Notepad++,url: https://notepad-plus-plus.org/ Resource Title: Blastp results versus non-redundant protein database. File Name: wheat_curl_mite_blastpnr.txtResource Description: Blastp results for protein coding unigenes from raw Trinity transcriptome assembly (wheat curl mite). Output format is default. Resource Software Recommended: Text wrangler,url: https://itunes.apple.com/us/app/textwrangler/id404010395?mt=12 Resource Title: Protein predictions for raw trinity transcriptome assembly (wheat curl mite). File Name: transcriptome.all.cds.pep.fasta.txtResource Description: Putative coding regions were predicted using Transdecoder. Default parameters were used in conjunction with Pfam-A searches to identify putative open reading frames (ORFs).Resource Title: Protein predictions for final transcriptome assembly (wheat curl mite). File Name: transcriptome.all.cds.pep.fasta.txtResource Description: Protein coding regions were predicted using Transdecoder. ORFs were identified using default parameters in conjunction with Pfam-A searches. Resource Software Recommended: Notepad++,url: https://notepad-plus-plus.org/ Resource Title: Protein predictions for final transcriptome assembly (wheat curl mite). File Name: transcriptome.all.cds.pep.fasta.txtResource Description: Protein coding regions were predicted using Transdecoder. ORFs were identified using default parameters in conjunction with Pfam-A searches. Resource Software Recommended: Text wrangler,url: https://itunes.apple.com/us/app/textwrangler/id404010395?mt=12 Resource Title: Final trinity transcriptome assembly for wheat curl mite. File Name: Trinity.mite.fasta.txtResource Description: Transcripts less than 200 nt and transcripts with TPM values less than 0.5 were removed from the assembly. In addition, transcripts whose coding sequences had highest scoring blastp matches to microbes were also removed from the assembly.Resource Title: Nucleotide coding regions for final transcriptome assembly for wheat curl mite. File Name: transcriptome.mite.cds.fasta.txtResource Description: Nucleotide sequences corresponding to coding regions from the final transcriptome assembly for wheat curl mite. Open reading frames (ORFs) were predicted using transdecoder. Default parameters with the addition of the identification of Pfam-A domains was used for ORF identification.Resource Title: Trinotate annotations for final Trinity assembly (wheat curl mite). File Name: trinotate.mite.xlsResource Description: Trinotate results for final wheat curl mite transcritpome assembly. Blastp and blastx searches against Swiss-Prot/Uni-Prot were performed along with Pfam-A searches using HMMER. Signal peptides and transmembrane domains were also identified. Resource Software Recommended: Excel,url: https://products.office.com/en-us/excel Resource Title: Trinotate annotations for final Trinity assembly (wheat curl mite). File Name: trinotate.mite.xlsResource Description: Trinotate results for final wheat curl mite transcritpome assembly. Blastp and blastx searches against Swiss-Prot/Uni-Prot were performed along with Pfam-A searches using HMMER. Signal peptides and transmembrane domains were also identified. Resource Software Recommended: Libre Office Calc,url: https://www.libreoffice.org/discover/calc/

Access & Use Information

Public: This dataset is intended for public access and use. License: us-pd

Downloads & Resources

Dates

Metadata Created Date March 30, 2024
Metadata Updated Date March 30, 2024

Metadata Source

Harvested from USDA JSON

Additional Metadata

Resource Type Dataset
Metadata Created Date March 30, 2024
Metadata Updated Date March 30, 2024
Publisher Agricultural Research Service
Maintainer
Identifier 10.15482/USDA.ADC/1471685
Data Last Modified 2024-02-15
Public Access Level public
Bureau Code 005:18
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id 9a8fc391-1beb-4bc5-a2da-b0b6848c17f0
Harvest Source Id d3fafa34-0cb9-48f1-ab1d-5b5fdc783806
Harvest Source Title USDA JSON
License https://www.usa.gov/publicdomain/label/1.0/
Program Code 005:040
Source Datajson Identifier True
Source Hash fc2d3700aaece138651d66862ef5e6f9217ac8eb7569e76b6a7fb9d7cd14557c
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.