Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

Data from: Assessing metabolomic and chemical diversity of a soybean lineage representing 35 years of breeding

Metadata Updated: June 5, 2025

Information on crop genotype- and phenotype-metabolite associations can be of value to trait development as well as to food security and safety. The unique study presented here assessed seed metabolomic and ionomic diversity in a soybean (Glycine max) lineage representing ~35 years of breeding (launch years 1972–2008) and increasing yield potential. Selected varieties included six conventional and three genetically modified (GM) glyphosate-tolerant lines. A metabolomics approach utilizing capillary electrophoresis (CE)-time-of-flight-mass spectrometry (TOF-MS), gas chromatography (GC)-TOF-MS and liquid chromatography (LC)-quadrupole (q)-TOFMS resulted in measurement of a total of 732 annotated peaks. Ionomics through inductively-coupled plasma (ICP)-MS profiled twenty mineral elements. Orthogonal partial least squares-discriminant analysis (OPLS-DA) of the seed data successfully differentiated newer higher-yielding soybean from earlier lower-yielding accessions at both field sites. This result reflected genetic fingerprinting data that demonstrated a similar distinction between the newer and older soybean. Correlation analysis also revealed associations between yield data and specific metabolites. There were no clear metabolic differences between the conventional and GM lines. Overall, observations of metabolic and genetic differences between older and newer soybean varieties provided novel and significant information on the impact of varietal development on biochemical variability. Proposed applications of omics in food and feed safety assessments will need to consider that GM is not a major source of metabolite variability and that trait development in crops will, of necessity, be associated with biochemical variation. Resources in this dataset:Resource Title: Pointer to Electronic Supplementary Material. File Name: Web Page, url: https://link.springer.com/article/10.1007/s11306-014-0702-6#Sec17 Link to Electronic Supplementary Material at Metabolomics. Files are: Supplementary material 1: Full metabolite profile and ionomic dataset - Download Excel Supplementary material 2: List of annotated metabolites - Download Excel

List of the 681 annotated metabolites obtained by using 4 platforms after data summarization and removal of 50% missing value in the metabolite profile data. Some metabolite peaks could not be summarized and thus kept in the list.

Supplementary material 3: Monsanto_ionomics_Data_Baxterla - Download Excel Supplementary material 4: Metabolomics metadata - Download .docx

Plant context metadata; Chemical analysis metadata.

Supplementary material 5: Spearman correlations between yield and metabolites/ions - Download Excel Supplementary material 6: Supporting Tables and Figures - Download .docx

Supporting Table 1.Similarity matrix: Genetic similarity of different soybean varieties based on genetic fingerprint data. Supporting Table 2. Metabolite Coverage of Analytical Platforms. Supporting Table 3. Summary of Statistically Significant Differences in Ionomic Profiles. Supporting Figure A. PCA (principal components one and two) based on the genotypic data of 1,484 pre-commercial and commercial proprietary Monsanto lines. Supporting Fig. 1. Evaluation of the achieved coverage of metabolite profile data. Supporting Fig. 2. Principal component analysis of the identified or annotated metabolites/ peaks. Supporting Fig. 3. Principal component analysis of the identified or annotated metabolites/peaks and including the ionomics data. Supporting Fig. 4. The score scatter plot of OPLS-DA using the identified or annotated metabolites/ peaks and including the ionomics data. Supporting Fig. 5. Graphic representation of nodes of the first neighbors in the yield-to-metabolite correlation networks of samples harvested at ILJA and ILJE.

Access & Use Information

Public: This dataset is intended for public access and use. License: Creative Commons Attribution

Downloads & Resources

Dates

Metadata Created Date March 30, 2024
Metadata Updated Date June 5, 2025

Metadata Source

Harvested from USDA JSON

Additional Metadata

Resource Type Dataset
Metadata Created Date March 30, 2024
Metadata Updated Date June 5, 2025
Publisher Agricultural Research Service
Maintainer
Identifier 10.1007/s11306-014-0702-6
Data Last Modified 2025-05-02
Public Access Level public
Bureau Code 005:18
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id a913c5d8-efd1-4d64-aaa2-2e73f87438c3
Harvest Source Id d3fafa34-0cb9-48f1-ab1d-5b5fdc783806
Harvest Source Title USDA JSON
License https://creativecommons.org/licenses/by/4.0/
Old Spatial {"type": "Polygon", "coordinates": -90.639984, 42.510065, -88.788778, 42.493634, -87.802929, 42.493634, -87.83579, 42.301941, -87.682436, 42.077386, -87.523605, 41.710431, -87.529082, 39.34987, -87.63862, 39.169131, -87.512651, 38.95553, -87.49622, 38.780268, -87.62219, 38.637868, -87.655051, 38.506421, -87.83579, 38.292821, -87.950806, 38.27639, -87.923421, 38.15042, -88.000098, 38.101128, -88.060345, 37.865619, -88.027483, 37.799896, -88.15893, 37.657496, -88.065822, 37.482234, -88.476592, 37.389126, -88.514931, 37.285064, -88.421823, 37.153617, -88.547792, 37.071463, -88.914747, 37.224817, -89.029763, 37.213863, -89.183118, 37.038601, -89.133825, 36.983832, -89.292656, 36.994786, -89.517211, 37.279587, -89.435057, 37.34531, -89.517211, 37.537003, -89.517211, 37.690357, -89.84035, 37.903958, -89.949889, 37.88205, -90.059428, 38.013497, -90.355183, 38.216144, -90.349706, 38.374975, -90.179921, 38.632391, -90.207305, 38.725499, -90.10872, 38.845992, -90.251121, 38.917192, -90.470199, 38.961007, -90.585214, 38.867899, -90.661891, 38.928146, -90.727615, 39.256762, -91.061708, 39.470363, -91.368417, 39.727779, -91.494386, 40.034488, -91.50534, 40.237135, -91.417709, 40.379535, -91.401278, 40.560274, -91.121954, 40.669813, -91.09457, 40.823167, -90.963123, 40.921752, -90.946692, 41.097014, -91.111001, 41.239415, -91.045277, 41.414677, -90.656414, 41.463969, -90.344229, 41.589939, -90.311367, 41.743293, -90.179921, 41.809016, -90.141582, 42.000709, -90.168967, 42.126679, -90.393521, 42.225264, -90.420906, 42.329326, -90.639984, 42.510065}
Program Code 005:040
Source Datajson Identifier True
Source Hash 3b8cfa41034d811b041d5a861720c5b6e58f4c9703c607c1ee1eb726bea62b76
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -90.639984, 42.510065, -88.788778, 42.493634, -87.802929, 42.493634, -87.83579, 42.301941, -87.682436, 42.077386, -87.523605, 41.710431, -87.529082, 39.34987, -87.63862, 39.169131, -87.512651, 38.95553, -87.49622, 38.780268, -87.62219, 38.637868, -87.655051, 38.506421, -87.83579, 38.292821, -87.950806, 38.27639, -87.923421, 38.15042, -88.000098, 38.101128, -88.060345, 37.865619, -88.027483, 37.799896, -88.15893, 37.657496, -88.065822, 37.482234, -88.476592, 37.389126, -88.514931, 37.285064, -88.421823, 37.153617, -88.547792, 37.071463, -88.914747, 37.224817, -89.029763, 37.213863, -89.183118, 37.038601, -89.133825, 36.983832, -89.292656, 36.994786, -89.517211, 37.279587, -89.435057, 37.34531, -89.517211, 37.537003, -89.517211, 37.690357, -89.84035, 37.903958, -89.949889, 37.88205, -90.059428, 38.013497, -90.355183, 38.216144, -90.349706, 38.374975, -90.179921, 38.632391, -90.207305, 38.725499, -90.10872, 38.845992, -90.251121, 38.917192, -90.470199, 38.961007, -90.585214, 38.867899, -90.661891, 38.928146, -90.727615, 39.256762, -91.061708, 39.470363, -91.368417, 39.727779, -91.494386, 40.034488, -91.50534, 40.237135, -91.417709, 40.379535, -91.401278, 40.560274, -91.121954, 40.669813, -91.09457, 40.823167, -90.963123, 40.921752, -90.946692, 41.097014, -91.111001, 41.239415, -91.045277, 41.414677, -90.656414, 41.463969, -90.344229, 41.589939, -90.311367, 41.743293, -90.179921, 41.809016, -90.141582, 42.000709, -90.168967, 42.126679, -90.393521, 42.225264, -90.420906, 42.329326, -90.639984, 42.510065}
Temporal 1972-01-01/2008-12-31

Didn't find what you're looking for? Suggest a dataset here.