Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content

UPIC

Metadata Updated: March 30, 2024

We introduce here the concept of Unique Pattern Informative Combinations (UPIC), a decision tool for the cost-effective design of DNA fingerprinting/genotyping experiments using simple-sequence/tandem repeat (SSR/STR) markers. After the first screening of SSR-markers tested on a subset of DNA samples, the user can apply UPIC to find marker combinations that maximize the genetic information obtained by a minimum or desirable number of markers. This allows a cost-effective planning of future experiments. We have developed Perl scripts to calculate all possible subset combinations of SSR markers, and determine based on unique patterns or alleles, which combinations can discriminate among all DNA samples included in a test. This makes UPIC an essential tool for optimizing resources when working with microsatellites. An example using real data from eight markers and 12 genotypes shows that UPIC detected groups of as few as three markers sufficient to discriminate all 12-DNA samples. Should markers for future experiments be chosen based only on polymorphism-information content (PIC), the necessary number of markers for discrimination of all samples cannot be determined. We also show that choosing markers using UPIC, an informative combination of four markers can provide similar information as using a combination of six markers (23 vs. 25 patterns, respectively), granting a more efficient planning of experiments. Perl scripts with documentation are also included to calculate the percentage of heterozygous loci on the DNA samples tested and to calculate three PIC values depending on the type of fertilization and allele frequency of the organism. The UPIC zip file contains 2 perl scripts, a README, and sample input and the resulting outputs. We would appreciate citation if you use them. As of 1 November, 2010, the zip file also contains an beta optimized script (upic_optimum_v1.1.20101101.pl) that produces a comma separated file, with all the markers that discriminate at least one line, which shows which lines have unique patterns. This allows you to select markers by score & line. Resources in this dataset:Resource Title: UPIC version 1.2. File Name: UPIC_v1.2.zip

Access & Use Information

Public: This dataset is intended for public access and use. License: Creative Commons CCZero

Downloads & Resources

Dates

Metadata Created Date March 30, 2024
Metadata Updated Date March 30, 2024

Metadata Source

Harvested from USDA JSON

Additional Metadata

Resource Type Dataset
Metadata Created Date March 30, 2024
Metadata Updated Date March 30, 2024
Publisher Agricultural Research Service
Maintainer
Identifier 10.15482/USDA.ADC/1529201
Data Last Modified 2024-02-13
Public Access Level public
Bureau Code 005:18
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id 8db650a1-74ff-4c8a-80bf-ac591cd5d331
Harvest Source Id d3fafa34-0cb9-48f1-ab1d-5b5fdc783806
Harvest Source Title USDA JSON
License https://creativecommons.org/publicdomain/zero/1.0/
Old Spatial {"type": "Polygon", "coordinates": -530.15625, -83.194895636616, -530.15625, 85.255070486924, -170.15625, 85.255070486924, -170.15625, -83.194895636616, -530.15625, -83.194895636616}
Program Code 005:040
Source Datajson Identifier True
Source Hash f1b4a337ea413c3b6f2e80f6ab10a1de0a897cf40131ced313b5437c6c7291c7
Source Schema Version 1.1
Spatial {"type": "Polygon", "coordinates": -530.15625, -83.194895636616, -530.15625, 85.255070486924, -170.15625, 85.255070486924, -170.15625, -83.194895636616, -530.15625, -83.194895636616}

Didn't find what you're looking for? Suggest a dataset here.